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Preface 


It is the task of the historian to give back to the past its sense 
of the future. 
—unattributed quote. 


This book is based on a course on the history of mathematical analysis that I taught 
for four years to third- and fourth-year undergraduates at the University of Warwick 
from 2005 to 2008. It concentrates on the history of topics in analysis, starting with 
three topics that were active around 1800 and it stops just short of Lebesgue’s 
introduction of measure theory. 

It is a historical approach, concerned with how ideas came about, what problems 
people were addressing, why they thought these ideas would help, and what they 
made of what they discovered. Very often, the mathematicians were surprised and 
puzzled too. The thread to follow is what interested them, not to get lost in the 
details of mathematics, which can indeed be unclear. That said, many of these 
topics are the staple ingredients of courses in mathematical analysis. Much of this 
material in this book illuminates this classical analysis by showing what can go 
wrong and so motivates the need for carefully worded theorems. But such examples 
should not be regarded as unwelcome guests at the feast, traps for the unwary, and 
certainly not as merely providing ‘catch questions’ for problem sheets and exams. 
Rather, they are examples of what can happen, clues to what else is to be dis- 
covered, stages on the endless path to giving a correct and comprehensive account 
of a subject. And, as this book hopes to make clear, a certain degree of insight is 
needed to appreciate which are the important ones, the fruitful examples to pursue, 
and we shall see that mathematicians have legitimately differed about this. 

While giving the course I was frequently reminded of one of the most striking 
problems involved in teaching mathematics: indicating how valuable new mathe- 
matics is possible and actually gets done. A student in any of the sciences may 
easily enter university with some idea of what is happening in research in their 
subject, but despite the growing numbers of very welcome popularisations of 
mathematics in recent years it remains almost impossible to bring, shall we say, any 
of the Clay Problems to an audience of beginners (with the possible exception 
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of the P versus NP problem). The problem is especially acute with the purer and 
less utilitarian branches of mathematics. 

It is my belief that working mathematicians assimilate a certain body of 
knowledge and make it their own. They know how to use it to tackle certain 
problems, but they also have an idea of what they would like to know but do not. 
With some further work they may form a programme for getting there, conjecture a 
useful interim result, discover a misunderstanding, even a mistake in what they 
thought they knew. The attempt to obtain new results shapes their sense of what 
they know and what they would like to discover: a lemma, an estimate, an example, 
a counter-example, perhaps even the long sought-for result. I have tried in this book 
to convey my sense of how the analysts of the nineteenth century found their ways, 
imperfectly, from what they knew to what they wanted to know. 

I found it curiously helpful to use Maple to generate pictures of functions. These 
graphs are not proofs, they may well be misleading, but they are suggestive, and 
they provide a way of looking carefully at the functions themselves. I hope they 
form a bridge between the student’s understanding and that of the well-educated 
mathematician of more than a hundred years ago. 

I have not respected the modern distinction between real and complex analysis 
for the simple reason that it did not operate in the nineteenth century. Indeed, as will 
be explained, neglecting it proved to be one of the sources of one of the finest 
discoveries of real analysis: functions that are continuous on an interval and dif- 
ferentiable nowhere (see Sect. 20.4). 

It is my hope that this book will be part of a four-volume series, the first volume 
of which, Worlds out of Nothing (2011), covers topics in the history of geometry in 
the nineteenth century, the third, now almost completed, will be on algebra in the 
same period, and the final volume of which will be on the history of differential 
equations. Like the first book, this was originally a course of 30 lectures, structured 
around three assessment points. I do not want to repeat the advice about writing 
essays that I gave in the earlier book, although I stand by it, but I have taken the 
opportunity to treat three chapters as an opportunity not only to look back at what 
has been discussed but to think about how it can be described and to address the 
student more directly. The point at issue is the difference between a list of facts and 
a good essay. A good essay is an argument; it expresses a point of view which it 
supports with evidence. It is attentive to other points of view, refuting them as 
necessary. It is an attempt to persuade. 

There can be many different but equally good answers to an essay question. This 
depends partly on who forms the intended audience, partly on the author’s sensi- 
bilities, partly on the space and time available. As a teacher I favour the short essay 
because it is hard: it is difficult to select the most important items, relatively easy to 
reply with ‘everything’. It is, of course, difficult as a teacher not to reward students 
who uncritically but astutely follow the ‘party line’, especially when, as here, the 
author has strong opinions. To that end students should be encouraged to track 
down other books in their university library, and I have made some suggestions as 
this book proceeds. 
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A problem with teaching the history of the mathematics of the nineteenth 
century to an English-speaking audience is that so little was written in English. In 
an Appendix I have given my own translations of extracts from a paper by Dirichlet 
on Fourier series, papers by Riemann introducing complex function theory and the 
Riemann integral, and the paper by Schwarz where he mapped a circle analytically 
onto a square. An extract from the English translation of Fourier’s account of 
Fourier series is also supplied. A small number of other sources are now available 
in English including the collected works of Riemann, which is very welcome. 
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The first three chapters set the scene by describing Lagrange’s attempt to provide 
rigorous foundations for the calculus, Fourier’s introduction of the series now 
named after him, and Legendre’s work on elliptic integrals. All these topics are 
sources for much of the subsequent development, although only the second is likely 
to seem familiar. But Lagrange’s ideas set the scene for Cauchy’s, and the topic of 
elliptic integrals is not hard, was obviously important by the standards of 1800, and 
has surprising later ramifications. 

The scene being set we now meet Cauchy, with two chapters on his work on real 
analysis and one on the much more confused state of his work in the 1820s and 
1830s on complex analysis. The marked difference in his understanding was mir- 
rored by the circumstances of publication, with important implications for decades. 
Three chapters follow on how Abel, Jacobi, and Gauss made complex elliptic 
functions out of real elliptic integrals, which I argue was one of the most important 
sources of subsequent interest in complex analysis (and more visible than Cauchy’s 
work). Then come two chapters bringing elliptic functions and complex functions 
together, and we can pause to take stock. 

The next part of the book picks up the topic of potential theory and the con- 
tributions of Gauss, Green, and Dirichlet, before turning to the topic of Dirichlet’s 
work on Fourier series and Riemann’s work on trigonometric series. Then come 
several chapters on the rival developments of complex analysis by Riemann and his 
successors on the one hand and Weierstrass and his supporters on the other. The 
lecture course included a chapter on minimal surfaces at this point, as an illustration 
of how new subjects in mathematics sell themselves by the unsuspected but 
powerful connections they make with other branches of the subject, but I have 
decided that that story is better placed in the planned history of differential 
equations. ' 

After a further pause for taking stock the book turns to topics in the real analysis 
of the later nineteenth century: uniform convergence and what lies beyond, what 


‘For a much fuller history of the study of minimal surfaces, see the forthcoming (Gray and 
Micallef). 
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were called ‘assumptionless functions’ and their strange properties, with the 
questions they raise about the nature of point sets and the properties of the integral, 
culminating in the collapse of the fundamental theorem of the calculus. Then comes 
the explicit construction of the real numbers and a look at the first proofs of the 
implicit function theorem (this replaces a chapter on the first rigorous methods in 
potential theory, which I have moved to the projected volume on the history of 
differential equations). The final three chapters before the last revision chapter look 
forward and deliberately look at topics that were not by any means resolved by 
1900: paradoxes to do with length and area and the behaviour of the integral; the 
Cantorian theory of infinite sets; and some topics where topology finally surfaces. 
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Chapter 1 
Lagrange and Foundations for the Calculus 


1.1 Introduction 


Mathematical analysis has many beginnings, and in the 19th century its history has 
a lot to do with the reformulations of the calculus that gradually led to the creation 
of adequate foundations and standards of rigorous argument. This does not mean 
that these were the only issues occupying the minds of mathematicians, or that they 
succeeded completely, but in important ways the subject was greatly transformed as 
the century progressed. 

This chapter and the next two are intended to set the scene, and to give a glimpse 
of what was happening between, roughly, 1780 and 1820. This covers the period 
of the French Revolution and the Napoleonic wars, and it is a largely French, even 
Parisian, story. In this chapter we look briefly at various 18th-century attempts to say 
why the calculus worked, concluding with what can be regarded as the last of the 
18th-century attempts. It is due to Joseph-Louis Lagrange, and its failure opened the 
way for the radically different accounts that followed.! 


1.2 The Calculus c. 1800 


The calculus was the independent creation of Isaac Newton in the 1660s and Gottfried 
Wilhelm Leibniz in the 1660s and 1670s. In the 18th Century it ground slowly to a 
halt in Britain, but it developed with remarkable vigour on Continental Europe, in 
the hands of the brothers Jakob and Johann Bernoulli, who were in direct contact 
with Leibniz, and with Johann Bernoulli’s one-time pupil, the remarkable Leonhard 
Euler. In the course of a long life spend first as a student in Basel and then divided 


‘For arich and stimulating account of rigour in 1 8th-century mathematics, and of what was involved 
in the transition to the 19th century, I particularly recommend Schubring (2005). 
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between the Academies of St. Petersburg, Berlin, and St. Petersburg again Euler 
wrote so much that the still-incomplete edition of his collected works has run to 78 
volumes.” 

The 18th century calculus dealt in both single and many variable problems, it 
embraced differential equations, first ordinary and then partial, it had a theory of 
differentiation and integration, it was full of clever techniques. In Euler’s hands 
particularly, it was a theory of functions: dependent variables were functions of 
independent variables. (The calculus of Newton and Leibniz had dealt in variables 
mutually varying.) Above all, the calculus became the core domain of mathematics, 
it illuminated geometry and it seemed to be the only way to the mathematical study 
of nature. 

At its core stood a painful paradox. The simple and invariably correct rules for 
differentiation and integration were established by arguments that invoked: the van- 
ishing of negligible quantities; arguments about infinitesimal quantities; plausible 
limit arguments that nonetheless seemed close to giving rules for evaluating 0/0. In 
short, the calculus worked—but no-one could adequately say why. 

This was not for the want of trying. Euler wrote at length on this, as on everything 
else, but his view was that the naive intuitions could be trusted if they were stated as 
clearly as they could be. However, neither he, nor anyone else, achieved that goal. For 
example, in his /nstitutiones calculi differentialis (1755), one of his major treatises 
that defined the calculus for the later 18th Century, Euler spelled out his opinion in 
this frequently quoted passage from his Jnstitutiones calculi differentialis (1755).° 


83. This doctrine of the infinite, however, will be better explained when we explain what 
the infinitely small of the mathematicians is. There is no doubt that any quantity can be 
diminished until it vanishes and is transformed, into nothing. But an infinitely small quantity 
is nothing else but a vanishing quantity and, therefore, actually will be = 0. This definition 
of the infinitely small is in agreement with the other which states that it is smaller than any 
given quantity. Indeed, if a quantity were so small that it is smaller than any given one, then 
it certainly could not be anything else but zero; for if it were not = 0, then a quantity equal 
to it could be shown, which is against the hypothesis. To those who ask what the infinitely 
small quantity in mathematics is, we answer that it is actually = 0. Hence there are not so 
many mysteries hidden in this concept as there are usually believed to be. These supposed 
mysteries have rendered the calculus of the infinitely small quite suspect to many people. 
Those doubts that remain we shall thoroughly remove in the following pages, where we shall 
explain this calculus. 


After noting that it may make sense to write 2 : 1 = 0: 0 because the first term 
is twice the second and the third time twice the fourth, Euler went on 


§85 ...It is known that ...1.0 = 0 as well asn : 1 = 0: 0. From this it seems possible that 
two quantities, whatever their geometric ratio may be, will always be equal if we look at them 
from the arithmetic point of view. Hence if two zeros can have an arbitrary ratio, then I judge 
that different signs should be applied, especially when we have to consider a geometric ratio 
of different zeros. The calculus of the infinitely small is therefore nothing but the investigation 
of the geometric ratio of different infinitely small quantities. This enterprise will be thrown 


?For a growing and invaluable guide, consult the Euler Archive on the web. 
3The translation is taken from Struik (1969, 384). 
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into the greatest confusion unless we use different signs to indicate these infinitely small 
quantities .... 


§86 Hence, if we introduce into the infinitesimal calculus a symbolism in which we denote 
dx an infinitely small quantity, then dx = 0 as well as adx = 0 (a an arbitrary finite quantity). 
Notwithstanding this, the geometric ratio adx : dx will be finite, namely a : 1, and this is 
the reason that these two infinitely small quantities dx and adx (though both = 0) cannot be 
confused with each other when their ratio is investigated. Similarly, when different infinitely 
small quantities dx and dy occur, their ratio is not fixed though each of them = 0. 


Whatever this may mean, it cannot be said to do more than gesture at what might 
be involved in rigorising the calculus, 

Euler’s only rival, for most of his life, was the French mathematician Jean le Rond 
d’Alembert, who concentrated his efforts on problems in applied mathematics and 
referred to the 18th century as the century of mechanics.* He owes his name to the 
fact that he had been abandoned as an infant on the steps of the church of St Jean 
le Rond in 1717, which is near Notre Dame in Paris, and he was brought up by 
the glazier’s wife, to whom he remained devoted. He had an excellent memory and 
grew up to be a brilliant conversationalist, which enabled him to become prominent 
in the competitive world of the Parisian salons that were frequented by influential 
people who were neither as rich and powerful as the nobility nor as conservative 
as the Church. In this milieu he rose to become a leading member of the group 
around Denis Diderot who produced the Encyclopédie (or, Dictionnaire raisonné des 
sciences, des arts, et des métiers, (1751-1772). This remarkable series of volumes 
was animated by the philosophy of rationalism, and is one of the major documents 
of the Enlightenment, and this led to heated conflict with the Jesuits and other forces 
of Catholic conservatism. In the ensuing struggles Diderot and his colleagues were 
supported by Voltaire and later by Condorcet, and eventually they succeeded in taking 
control of the Académie des Sciences. 

His interest in mechanics led d’ Alembert to take a Newtonian view of the founda- 
tions of the calculus. Newton had indicated the foundations of his calculus in these 
terms in a series of Lemmas on pages 38-39 of the Principia 


These Lemmas are premised to avoid the tediousness of deducing involved demonstrations ad 
absurdum, according to the method of the ancient geometers. For demonstrations are shorter 
by the method of indivisibles; but because the hypothesis of indivisibles seems somewhat 
harsh, and therefore that method is reckoned less geometrical, I chose rather to reduce the 
demonstrations of the following Propositions to the first and last sums and ratios of nascent 
and evanescent quantities, that is, to the limits of those sums and ratios, and so to premise, 
as short as I could, the demonstrations of those limits. 


These Lemmas have been much discussed. D.T. Whiteside, the editor of The 
Mathematical Papers of Isaac Newton, brushed them aside as not, after all, central to 
the arguments in the Principia, and called them “undeniably a retrospective gloss”.° 


Indeed, it is clear that Newton did not derive the results of the Principia by his calculus 


4On d’ Alembert’s life and times see Hankins (1970). A vigorous new edition of all of d’ Alembert’s 
papers is also underway. 


5See Whiteside, The Mathematical Papers of Isaac Newton, Vol. 6, p. 108. 
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and then cover his tracks with geometry, as was once suggested. However, Pourciau 
(1998, 281) argued forcefully that “each lemma is itself (a natural replacement for) 
an elementary and basic definition, property, or theorem of the calculus” [Pourciau’s 
italics]. The point to note in the context of 18th-century explorations of these issues 
is that an approach to the calculus in terms of some, not entirely precise, idea of 
limits was advocated by some authors and carried a Newtonian pedigree. 

D’ Alembert addressed directly the question of how to defend a limiting argument 
that appears to make sense of an expression of the form 0/0.° Unfortunately his 
answer is confused at crucial points—a familiar feature of his written work, which 
lacked the elegance of his conversations. 

He first suggested that 0/0 can have any value, including the correct one, and then 
argued that the limit is not the ratio 0/0 because this ratio is not defined. This may 
be the source of another problem with his concept of a limit, which he defined this 
way. 

One magnitude is said to be the limit of another magnitude when the second may approach 

the first within any given magnitude, however small, though the first may never exceed the 


magnitude it approaches; so that the difference of such a quantity to its limit is absolutely 
unassignable. 


It seems that, for d’Alembert, a limit may be approached, but only from one side, 
and never attained. 


1.3 Joseph Louis Lagrange 


Joseph Louis Lagrange (1736-1813) 


®See ‘Limite’, Encyclopédie, vol. 9, 1765. There is an extract from d’ Alembert’s essay on differ- 
entials, Encyclopédie, vol. 4, (1754) in Struik (1969, 342-345) and also in Fauvel and Gray (1987, 
18.A3). 
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There matters rested until Joseph Louis Lagrange took up the question. Lagrange 
was born in Turin in 1736. He was not an Italian, for a unified Italy was over a century 
away, but rather someone who was in some ways Italian and in others French. From 
his youth onwards he was devoted to the work of Euler, his earliest interests were in 
the calculus of variations and in mechanics, notably planetary astronomy, branches of 
the calculus in which Euler was the master. The younger man then followed him into 
researches in algebra and number theory. In all of these areas he was eventually to 
surpass Euler, for where Euler was a brilliant “experimenter” if you like, a discoverer 
of remarkable facts, Lagrange was the one who came up with the proofs; proofs 
that elude Euler are of a high calibre, and Euler was impressed. Yet Lagrange was 
a very shy man, as diffident as he was gifted; Clairaut described him in 1764 as “a 
young man as remarkable for his talents as for his modesty; his character is mild and 
melancholy; he knows no other pleasure than to study”.’ Remarkably, although Euler 
and Lagrange corresponded frequently, they never met. Indeed, Lagrange would 
only go to the Berlin Academy of Sciences after Euler had left in 1766 to resume 
directorship of the St Petersburg Academy; it seems Lagrange believed Euler to 
be too much his superior—even though Lagrange was succeeding Euler as director 
in Berlin. He stayed in Berlin until he moved to Paris in 1787, and over the next 
twenty years—the years of the Revolution and of Napoleon—Lagrange contributed 
to establishing Paris as the new world centre for mathematics in France. 


Higher Education in the French Revolution 


By the mid-1790s the French educational system had totally collapsed, and out of 
the chaos came two vigorous new institutions. One, founded in 1794, was the Ecole 
Normale de |’ An I1J]—that is, Normal School of the Year Three (of the Republic, the 
revolutionaries redefined the calendar and in that way sought to redefine time itself) 
where teachers were to be trained. Potential student-teachers from throughout France 
were assembled in Paris and lectured to, in an endeavour to establish educational 
norms for the whole country, by the leading mathematicians of France: Lagrange, 
Pierre-Simon Laplace and Gaspard Monge. In this form the school lasted only a few 
months (it was later refounded as the Ecole Normale Supérieure, originally intended 
to provide training for teachers), but its influence lasted longer, because the lectures 
were copied down (by stenographers in the audience) and printed. The educational 
reformers were clear that the educational system for all France should be determined 
and influenced by decisions and practices in Paris, and further, that there should be 
a strong mathematical component in the curriculum at all levels. 

The Ecole Polytechnique was also founded in 1794, with the task of training 
large numbers of students in physics, mathematics and engineering.® It was the 
first institution in the world of this kind, and every year students competed across 
the whole of France to become one of the two hundred or so entrants. Monge and 


7See the letter to D. Bernoulli, quoted in Euler, Opera Omnia, IV, AS, p. 330, n. 2. 


8It was originally called the Ecole Centrale des Travaux Publics (the Central School for Public 
Works) thus for civil and military engineering, and given its present name the in the following year. 
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Lagrange taught there, and Pierre Simon Laplace held the even more influential 
position of examiner—he determined what would be examined as well as what level 
of attainment the students ought to display. The founding of the Ecole Polytechnique 
is a significant event in the rise of the modern university. 

In 1799, the Ecole Polytechnique was re-organised. It now provided two years of 
courses that were preparatory to the specialised training that graduates of the school 
could go on to obtain in one of the higher Ecoles (such as the School of Mining, the 
School of Artillery, the School of Military Engineering, and the School of Bridges 
and Roads). In 1804 Napoleon ordered that the Ecole Polytechnique be militarised, 
and the students treated like cadet soldiers. It has been suggested that a factor in 
his wish for a more disciplined regime at the school was the students’ reluctance to 
welcome Napoleon’s appointing himself Emperor. 


1.4 Lagrange on the Foundations of the Calculus 


To return to the story of the calculus, in 1784 Lagrange, then still at the Berlin 
Academy, proposed that the Academy award a prize for 


aclear and precise theory of what is called Infinity in mathematics [because] It is well known 
that higher mathematics continually uses infinitely large and infinitely small quantities. 
Nevertheless, geometers, and even the ancient analysts, have carefully avoided everything 
which approaches the infinite. 


The matter was to be “treated with all possible rigour, clarity, and simplicity”, so it 
is clear that the judges were not asking for a few well-phrased definitions but for a 
new and substantial effort to vindicate the calculus. 

In this they were unsuccessful.? No major mathematician entered, and the judges 
reported that the entries lacked rigour, they failed to deal with the problem of deduc- 
tion from contradictory assumptions (for example, that a quantity is both zero and 
non-zero), and they did not explain “how so many true theorems have been deduced”. 

So when Lagrange began teaching analysis at the Ecole Polytechnique in 1797 
the pressure was on him to meet the challenge he had earlier set, and he published 
his Théorie des fonctions analytiques. All his mathematics has an algebraic cast, and 
the full title of this work makes clear that this is no exception. It runs 


Théorie des fonctions analytiques contenant les principes du calcul différentiel, dégagé de 
toute considération d’infiniment petits ou d’évanouissans, de limites ou de fluxions, et réduits 
a l’analyse algébrique des quantités finis. 


That is: The theory of analytic functions containing the principles of the differential 
calculus disengaged from any consideration of the infinitely small or vanishing quan- 
tities, limits and fluxions, and reduced to the algebraic analysis of finite quantities. 


°The prize was awarded to Simon Antoine Jean L’Huillier for his Exposition élémentaire (1787). 
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He argued that the basic principles of the calculus had been neglected by earlier 
mathematicians who had been concerned only with obtaining solutions to problems, 
and had therefore entertained unrigorous concepts such as infinitesimals. However 
unfair—the method of infinitesimals proved to contain more of the kernel of the even- 
tual solution than Lagrange’s own methods—his criticisms were valid. He rejected 
Newton’s methods on two grounds: they depended the “foreign idea” of motion 
that belonged to physics, not mathematics; and there was no adequate definition of 
‘instantaneous variable velocity’. He rejected d’Alembert’s limit concept as being 
too vaguely geometric, and geometry, like motion, was “foreign” to the very spirit 
of analysis. And once he had rejected all previous attempts, he set about placing all 
previous results in the calculus within a new, rigorous framework. 

Lagrange showed, to his satisfaction, that every function f can almost always be 
expanded as a power series of the form 


fati=fa)+taitai*+---, 


where i stands for an arbitrary increment, provided that one could somehow derive 
the coefficient of the first power of i from f itself.'° He made a strong case that the 
meaning of f’(x) does not have to depend upon limits, infinitesimals, or prime and 
ultimate ratios: itis simply the coefficient of 7 in the series expansion of f (x +7). His 
line of reasoning is thoroughly algebraic, and such a claim, if valid, would go a long 
way towards making the calculus rigorous. However, his ideas were not completely 
convincing and, as we shall see in the next few chapters, they were soon replaced. 
But the force of his example, and his eminence as a mathematician, generated much 
debate over the foundations of the calculus: those mathematicians were not satisfied 
with merely obtaining results would now find it necessary to justify their methods 
much more rigorously. 

One might object that Lagrange’s enterprise is hopeless, and point out that the 
function defined by 


cannot be differentiable at the origin. But on such occasions Lagrange was known 
to observe that!! “This principle is generally valid, but I have remarked that it can 
be subject to exceptions that can make the preceding demonstration defective.” One 
interpretation of this remark might be that a claim with a convincing proof that was 
surely true in general but occasionally false might be viewed as true in the way 
that we today say that “Everyone can vote in this election” and mean something 
more like “Everyone can vote in this election except those barred by specific, well- 
known, pieces of legislation”. We shall meet this viewpoint that theorems may admit 
exceptions in a more sophisticated context below in Sect. 4.4. 
In a little more detail, Lagrange argued as follows. 


!0See the extract in Fauvel and Gray (1987, 18.A4) and below. 
‘Tn Lagrange (1826, 182) quoted in Sgrensen (2005, 470). 


8 1 Lagrange and Foundations for the Calculus 


1. In the expansion of f(x +7), where i is a small increment, no fractional powers 
of i can occur except for particular values of x. 

2. So f(x +i) = f(x) + f’(@x)i + f’(x)i? +--+; this expression defines 
POs @ysx 

3. In the expansion f(x +i) = f(x) + pit+qi?+ri>+--- replace x by x +0 
where o is independent of x, and by expanding f(x +7 + 0) in two ways (first, 
replace i by i + 0 in the above expansion, second replace x by x + 0) deduce that 
the process which determines p from f, say p = f’ also determines g from p 
by 2q = p’,r from q by 3r = q’, and so on. 

4. By considering the well-known cases from the elementary calculus, deduce that 
fiw= g and hence the familiar formula for the Taylor series. 


The best account of Lagrange’s approach to the foundations of the calculus, and 
the only one to respect its depth and novelty and so grasp its ultimate failure without 
anachronism is Ferraro and Panza (2012). They argue (p. 99) that 


numbers and geometric and mechanical magnitudes are, for Lagrange, quantities of a par- 
ticular sort, whereas his theory of analytical functions is intended to deal with quantities in 
general, or better in abstracto. 


So Lagrange took his algebra to apply to entirely abstract quantities, which he 
often called ‘algebraic quantities’, and (p. 100) 


Lagrange’s theory is algebraic insofar as it deals with algebraic quantities, [and] formal, 
insofar as these quantities are identified through the relations they have with each other, 
which are in turn displayed by appropriate formulas. 


But this saddled Lagrange with a problem, which was why his ‘algebraic’ quan- 
tities obey the usual rules for quantities, such as being ordered and varying con- 
tinuously. He did not want to reduce his general quantities to the special case of 
geometrical ones, for that would prevent him explaining why the calculus could be 
applied to geometry, and Ferraro and Panza argue that this is where and how he 
failed. Their conclusion is (p. 100) 


Our basic point in explaining Lagrange’s failure is precisely that his notion of algebraic 
quantity does not guarantee that algebraic quantities meet this crucial requirement, so much 
so that he cannot but surreptitiously suppose that they do meet it. Hence, this notion appeared 
too weak to bear the weight of Lagrange’s reductionist purpose. But once reinforced with 
this surreptitious assumption, it became too strong to play the role of a starting point of his 
foundational program. 


1.5 The Infinitely Small and the Infinitely Large 


It will become clear as this book unfolds that a number of mathematicians, Cauchy 
among them, were able to make great progress in developing and rigorising the 
calculus without always being clear what they were talking about. This is not the 
paradox it may seem; they were dealing with exceptionally difficult topics where 
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being productive is the best that can be hoped for. The historians’ job, on such 
occasions, is to make sense of the texts they wrote on their own terms, giving good 
reasons for their sense of what the phrase “on their own terms” means. 

To do this, careful reading of each specific text is required. Other texts may be 
consulted: those by the same author, those by contemporaries and influential prede- 
cessors. Definitions, when they were given, can be usefully compared with the use of 
the terms in practice, and may not agree. Meanings may be elucidated by establishing 
a set of ideas that the original author shared with his or her contemporaries. When the 
original text had an influence, the historian can gain a feeling for what the text may 
say by finding out how it was read by the author’s contemporaries and successors. 
If the author’s ideas were taken up, then on what grounds? How were they altered 
in subsequent use? If they were not taken up, then why not? Were they neglected, or 
explicitly rejected? Were they understood or misunderstood? 

Specifically, there was a profusion of methods in the early calculus that condensed 
in the hands of Newton and Leibniz to talk of limits and infinitesimals. In keeping 
with the procedures just outlined, these concepts are often glossed by historians of 
mathematics as something like “very very small” or “arbitrarily small” or “tending 
to zero”, in a loose chain of influences from Newton and Leibniz via the major 
names to Weierstrass and the language of standard analysis (¢ — 6 — N analysis). 
Most historians of mathematics give old authors the benefit of the doubt, while 
noting that more needs to be said by modern standards. It is indeed tiresome and 
ultimately ahistorical to read an old text as if marking an undergraduate essay. But 
interpretation can be difficult when subtleties may have led mathematicians into 
foundational problems, or even, as we shall see with Cauchy in Chap. 4, into error. 

For example, Leibniz introduced his differential calculus in 1684in terms of an 
axis AX, a curve VV, the perpendicular (or ordinate) VX to the axis, and a tangent VB 
intersecting the axis at B. Then he wrote!” 


Now some straight line selected arbitrarily is called dx, and the line which is to d as v is to 
XB is called dv, or the difference of these v. Under these assumptions we have the following 
rules of the calculus. 


If a is a given constant, then da = 0, and d(ax) = adx. If y = v (that is, if the ordinate of any 
curve YY is equal to any corresponding ordinate of the curve VV), then dy = dv. Now addition 
and subtraction: if z-— y+w+x =v, thend(z-—y+w+x) =dv=dz—dy+dw+dx. 
Multiplication: d(xv) = xdv + vdx, or, setting y = xv, dy = xdv+vdx. 


No matter that the rules are not explained—what is this difference dv? It would seem 
to be some, presumably small, segment of a straight line. But less than a page later, 
Leibniz wrote: 


We have only to keep in mind that to find a tangent means to draw a line that connects two 
points of the curve at an infinitely small distance, or the continued side of a polygon with 
an infinite number of angles, which for us takes the place of the curve. This infinitely small 
distance can always be expressed by a known differential like dv, or by a relation to it, that 
is, by some known tangent. 


2 See Struik Sourcebook, 272-280, F&G 13.A3. I have removed some repetitions from the text. 
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Now it is presented as an infinitely small distance. Could it be that Leibniz did indeed 
think of there being infinitely small distances, or was that more a way of speaking, 
a useful fiction? It is already clear that they have contradictory properties, and why 
should d(xv) not be written as (x + dx)(v+ dv) — xv = xdv + vdx + dxdv? 

In 1701 Leibniz published an article in which he said that the infinite need not 
be taken rigorously, “For in place of the infinite or the infinitely small we can take 
quantities as great or as small as necessary in order that the error be less than any given 
error.” Several people who had learned the calculus from him asked for a clarification, 
and he wrote a letter in 1702 saying that although infinitely small magnitudes cannot 
be real nonetheless geometry proceeds as if they are. However, he went on, actual 
infinite and infinitesimal quantities are only fictions that can be used for the sake of 
brevity.!> To quote a recent authority (Arthur 2013, 553-554): 


...the idea that Leibniz was committed to infinitesimals as actually infinitely small enti- 
ties is a misreading: his mature interpretation of the calculus was fully in accord with the 
Archimedean Axiom. Leibniz’s interpretation is (to use the medieval term) syncategore- 
matic: Infinitesimals are fictions in the sense that the terms designating them can be treated 
as if they refer to entities incomparably smaller than finite quantities, but really stand for 
variable finite quantities that can be taken as small as desired. 


Euler’s use of the concepts of infinitely large and infinitely small numbers also 
indicates the issues. The exponential function a*, a > 1 was presented in his Intro- 
ductio in analysin infinitorum §98 as a function that, among other properties, goes to 
infinity as z goes to infinity. Then, when explaining how to write the logarithm and 
exponential functions as power series, he considered what is meant by the expression 
a”’,a > 1, and wrote (§114): 


Let w be an infinitely small number, or a fraction so small that, although not equal to zero, 
still a’ = 1 + ~, where 7 is also an infinitely small number .... 


From this he deduced that one can write a” = 1 + kw. 
A few pages later (in §134) when explaining how to write the sine and cosine 
functions as power series, he wrote 


Let the arc z be infinitely small, then sinz = z and cosz = 1. If n is an infinitely large 
number, so that nz is a finite number, say nz = v, then .... 


How might these and many other remarks have been understood by him or his 
readers? The use of infinites and infinitesimals was not explained by him in the 
Introductio, so we must start by taking his words at face value. For example, there 
are infinitely small and infinitely large numbers, and they are not, apparently, the 
result of a limiting argument. This does not mean that Euler fully understood what 
has to be said to make the concept consistent with other concepts he used. It does not 
mean that his use of other terms, such as infinite series, was in any way different from 
the usual spread of ideas from Newton to Cauchy and Weierstrass. In the Introductio, 
Euler wrote an infinite series in the usual way, e.g. as A + Bz + C2 4+ D34+---, 


'3See Jesseph (2015, 194-195); the volume contains a number of important articles on Leibniz’s 
mathematics and philosophy. 
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and indeed insisted (§62) that “There is a law by which any coefficient is determined 
by a certain number of its predecessors”’. 

The same is true of Euler’s use of the concept of the infinitesimal, as the passage 
from his Jnstitutiones calculi differentialis showed. Taken at face value, Euler said 
that an infinitely small quantity is really equal to zero. Moreover, his handling of 
terms like dx in his discussion of foundations is open to obvious misuse; we can 
note that he does not explain how to avoid obvious contradictions, and he even, 
on occasion and presumably for pedagogic purposes, regarded one millionth as the 
“smallest possible amount” (Introductio §148). 

The most obvious interpretation of all this is that Euler was no more able to explain 
the well-known problems with the calculus than anyone else, but was comfortable 
with naive arguments that allowed him to get on with the job: infinite numbers are 
very, very big, bigger than any finite number; infinitesimals are very, very small, 
even vanishing. Implicit limiting arguments could be made explicit, but arguments 
with infinites and infinitesimals are quicker when you have learned how to use them 
wisely. Euler is intelligible, and of his time, when read in that way. 

Recently there have been attempts to argue that Leibniz, Euler, and even Cauchy 
could have been thinking in some informal version of rigorous modern non-standard 
analysis, in which infinite and infinitesimal quantities do exist. However, a historical 
interpretation such as the one sketched above that aims to understand Leibniz on 
his own terms, and that confers upon him both insight and consistency, has a lot to 
recommend it over an interpretation that has only been possible to defend in the last 
few decades. It is parsimonious and requires no expert defence for which modern 
concepts seem essential and therefore create more problems than they solve (e.g. 
with infinite series). The same can be said of non-standard readings of Euler; for a 
detailed discussion of Euler’s ideas in this connection, see Schubring (2005). This 
is not to say that Leibniz, Euler, or even Cauchy in certain ways, almost had the 
modern, classical, Weierstrassian kind of real analysis in mind. Plainly they did not. 
But it is to say that the foundations of the calculus were for at least two centuries the 
subject of shifting, partial, and largely coherent speculations that form the opening 
chapters of the history of analysis. 


Chapter 2 
Joseph Fourier 


Joseph Fourier (1768-1830) 


2.1 Introduction 


Fourier series are trigonometric series used to represent a function, and they are 
widely used throughout pure and applied mathematics. Fourier was not the first to 
use them, but his name is rightly attached to them because he was the first to use 
them in the study of heat diffusion, to display their use in the solution of a partial 
differential equation, and to argue successfully for their generality. 

There had been a long 18th-century debate about trigonometric series in con- 
nection with solutions to the wave equation and the shape of a vibrating string. On 
the one hand it seemed reasonable that a string could have any continuous initial 
shape—that was Euler’s view—on the other hand the equation could only be solved 
by functions to which the calculus applied (we would say that the solutions had to 
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be twice differentiable) and this made them what d’ Alembert called analytic. Con- 
vergence questions were not central to this debate, which was left unresolved in a 
number of ways. 

Fourier proposed to reopen the debate by boldly asserting that any solution to the 
heat equation, which he was the first to derive, could be written as an infinite sum of 
sines and cosines for the simple reason that any function could be written that way. 
This is a dramatic claim, and it was still more so in his day, because the consensus 
was that however broadly a function might be defined all the functions that arise in 
practice are finite sums of familiar ones: polynomials, sines, cosines, exponentials 
and logarithms, nth roots, and the like. They could also be infinite power series, 
and indeed infinite trigonometric series, but nonetheless they had the usual sorts of 
properties, such as smoothly varying graphs. No-one said so in so many words, but 
it is clear that the expectation was there, and Fourier in particular simply assumed 
that every function is continuous, as is clear from his account of the coefficients of 
a Fourier series in his (1822, §423). 

One of the dramas introduced by Fourier’s series was that they readily flout all 
these expectations. As we shall see, at various stages in the 19th century they provided 
fresh, and disturbing, examples of just what functions could do. Contrary to what 
Fourier himself believed, if Cauchy’s work began the exploration of what rigorous 
mathematics can do, Fourier series can indicate just what theory is up against. 


2.2 Fourier’s Career 


Joseph Fourier was born in Auxerre, France in 1768. He was orphaned at the age 
of 9 and placed in the town’s military school where he learned mathematics and a 
sense of civic responsibility. He was nearly guillotined at the height of the Terror 
in 1794, but the sentence was withdrawn and Fourier was able to go to the Ecole 
Normale. In 1795 he was appointed an assistant lecturer at the Ecole Polytechnique, 
working under Lagrange and Monge, and in 1798 Monge, a prominent supporter of 
Napoleon, selected Fourier to go on the French expedition to Egypt. After the British 
defeated them there, Fourier returned to France in 1801, hoping to resume his work 
at the Ecole Polytechnique, but Napoleon had been impressed by his organisational 
talents and sent him instead to be the prefect of Governor of the Department of 
Isére.! He was so successful here that Napoleon made him a Baron in 1808, and in 
1809 he finished his contribution to the Description d’Egypt, a massive account and 
glorification of ancient Egypt based on the surveys that French engineers had made 
of Egyptian pyramids and other remains.” 


' An administrative region of France that extended from Grenoble to the French border. 

>This period is the start of the celebration of ancient Egypt in the modern world, from Cleopatra’s 
and other needles to fanciful statements about ancient wisdom, secret knowledge, and so forth, none 
of which can be held against Fourier. See Buchwald and Feingold (2012). 
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The eventual defeat of Napoleon was the lowest point of Fourier’s life, but in 1816 
he obtained a position as Director of the Bureau of Statistics for the Department of 
the Seine, a position which left him good time for research. His political enemies 
now in power delayed his appointment to the reformed Academy of Sciences for a 
year but he eventually rose to become the permanent secretary of the Academy in 
1822 and to be elected to the Académie Frangaise in 1827. He died in 1830 as the 
result of complications from an illness caught in Egypt. 


2.3 Fourier and Series of Sines and Cosines 


The book Théorie analytique de la chaleur, in which Fourier presented his ideas, 
was written work in several stages. He submitted a version to the Paris Academy 
of Sciences in 1807, but although Laplace, Lacroix and Monge were in favour of 
publishing it, Lagrange blocked publication, apparently because its treatment of 
trigonometric series differed markedly from the way he, Lagrange, had stipulated in 
the 1750s. Another chance came in 1810, when the Academy of Sciences announced a 
prize competition on heat diffusion. Fourier submitted a revised memoir, which won, 
but was criticised for a lack of rigour and generality. Fourier thought the criticism 
unfair, but revised it again, and the resulting book came out in 1822 (after Lagrange’s 
death and when Fourier’s standing was rising in the Academy). 


Heat Diffusion 


Fourier was interested in finding the temperature at every point of a solid body, 
perhaps as a function of time, when the shape of the body, its physical properties, 
and the temperature on some or all of its boundary is given. He made no assumptions 
about the nature of heat and concentrated on how it flowed. 

He considered that any solid body could be regarded as made of infinitesimal 
cubes, and argued, on the basis of some observational evidence, that the amount of 
heat that passes from the hotter part of the body to an adjacent colder part in an instant 
of time is proportional to the duration of the instant, the infinitesimal temperature 
difference between opposite faces of the cubes and a certain function of the distance 
between the particles that depends on the nature of the body. So each body determines 
some constants that characterise how heat flows in them, such as its conductivity and 
its specific heat. In what follows all these physical matters will be consumed in the 
single letter K. 

He considered what happens as the heat flows through one of these infinitesimal 
cubes, where temperature v is a function of x, y, z and f, the time. What enters the 
face with sides dx and dy is Kdxdy 2 evaluated at that face.> What leaves the opposite 
face is Kdxdy% evaluated at that face. This amount Fourier evaluated by saying that 
the faces are a distance dz apart, so he replaced z by z + dz, and the difference in the 
amount of heat between what enters and what leaves is given by 


3] have modernised Fourier’s ‘d’ notation by writing Ov where he wrote dv. 
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Ov ay 
Kdydxd | — } = Kdxdydz—. 
. (5) epg 


Should the temperature be in a steady state the sum of these quantities taken over 
the three pairs of opposite faces of a cube is zero and the resulting equation is 

ay Bs Oy fe ay 

Ox? Oy? Az? 


= 0. (2.1) 


The more important situation is when the temperature is changing. Fourier now 
argued that the amount of heat leaving a cube in the z direction is once again 


Kdxdydz&3, but now the sum over the pairs of opposite faces equals the rate of 
change of temperature, which is given by e The result (see Théorie §128, p. 102) 


is the heat equation: 


(2.2) 


Ye Ov, Oy | Ov 
Ot Ox2 Oy? az? PE 


In each case, solutions of this partial differential equation are required that satisfy 
the given boundary conditions, and Fourier confined his attention to bodies with 
simple shapes, such as a cuboid, or one equivalent to this by a suitable coordinate 
transformation. 

For example, in ($166, p. 133) Fourier considered a semi-infinite strip of a given 
width, 7 in suitable units. He supposed that the temperature at the base is kept constant 
at lin some units and that the temperature of the infinite sides is kept at at 0, and 
looked for the corresponding steady state distribution of temperature. Let y measure 
the height above the base and x the horizontal distance of a point from the mid-line 
of the strip (I have relabelled his coordinates) so the differential equation, which now 
involves only two variables, is 


Ov &yv 
«(52+ 5) <2 i 


Fourier looked for a solution of the form 
v(x, y) = fx)g(), 


which leads to the equation g”(y)/g(y) = —f”(«)/f (x) in which both sides must 
be constant, say m, so the solutions are of the form 


f(x) = cos(mx), g(y)=e (2.4) 


The temperature in the bar surely does not become infinite, so the exponential term 
must decrease and so m must be positive. Also, m must be odd so that the solution 
vanishes for x = +7 for all y, as required. 
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It was clear to Fourier that a sum of solutions of this form is also a solution and 
he proceeded at once to consider infinite sums, solutions of the form, as he wrote 
($169, p. 135), 


ae~’ cos x + be~>” cos 3x + ce” cos 5x + de~!” cos 7x + etc. (2.5) 
subject to the boundary condition at the base that 
1 =acosx + bcos3x + ccos5x + dcos7x + etc. (2.6) 


The arbitrary constants had now to be determined. Fourier first gave a marvellous 
argument that involved him in solving the infinitely many equations he could obtain 
for his infinitely many unknowns by differentiating equation (2.6) arbitrarily often 
(see §$171-176 of the Théorie). Only then did he give the simpler and more general 
way that has become standard, and start to claim (§220, see §A.1) that every function 
can be written as one of these series. By this he meant that every function is equal to its 
corresponding series, and that there is a simple rule for writing down the coefficients 
of the series. 

He noted (§221) that the integral 


7 


= 1 1 1 
sin jx sinkxdx = - sin(k — j)x - sin(k + px) 
i : 2 € =j VO ei No 


(2.7) 


vanishes when j # k and is 7/2 when k = j, and claimed that the coefficients of a 
series such as his can be found by integrating the product of the series with sin jx for 
each value of 7 > 0. Similar results apply to series of cosines, to series of sines and 
cosines, and to series obtained when the period is different (as it might be, 27 or 1). 

He went on to claim that any function f defined on the interval [—7, 7] can be 
written as an infinite series of sines and cosines in any of these forms (called the 
mixed series, the cosine series and the sine series, respectively): 


1 foe) 
f@)= 500 + San cos nx + by, sinnx. 


n=1 


1 foe) 
fx) = 50 + San cos nx. 


n=1 


1 CO 
f(x) = 540 + > bi sin nx. 


n=1 


The coefficients of the mixed series are given by the formulae aj = t 1 Sf (x)dx 


and 
TT Tv 


1 1 
ay = = f(x) coskxdx, by = — f (x) sin kxdx. 
NT Jor Te a9 
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Examples of these series had already been studied by Euler and Daniel Bernoulli. 
He then gave several simple examples of his series, some of which such a function 
constant on a given interval, or equal to x on a given interval. He showed how to 
obtain the function cos x as an infinite series of sines, dealt with cosine series as well 
as sine series, and 
Fourier was very proud of his series for the function F(x) = 7/4: 


1 1 1 
cos(x) — 3 cos(3x) + 5 cos(5x) — 7 cos(7x) +--+, 


as well he might be, once you see what it looks like. Here are three graphs of it: 
Fig. 2.1 shows the sum of only the first 5 terms in the series, Fig. 2.2 the first 25, and 
Fig. 2.3 the first 105. 

Note that the difference between the sum of the series and the sum of its first 105 
terms is certainly less than 1/209 and generally much less. 

It is clear that the infinite series represents a function that is +7/4 on the range 
(—1/2, 7/2) and that is —7/4 on the range (+77/2, 37/2). Indeed, it represents a 
function that is +7/4 on the range ((4n — 1)7/2, (4n + 1)7/2), that is —7/4 on the 
range ((4n + 1), (4n + 3)7/2), and that is zero at the points x = (4n + 1)7/2. 


Fig. 2.1 The first 5 terms of 1 
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Fig. 2.3. The first 105 terms 1 
of a Fourier series 
a b——+ 

Yos 

4 -2 0 2 4 6 8 

x 

-0.5 

——— | b 
=1 


Note that it is plainly not the case that a series of analytic (and therefore in partic- 
ular continuous) functions is itself continuous. We shall see that Fourier’s confident 
remark ($235) that all these series converge everywhere to the function that they 
represent, and his proof of the claim in §423 (see Appendix A.1) that the coefficients 
in a ‘Fourier’ series can be evaluated as he indicated, were to be the occasion for 
much significant later work. 


Chapter 3 
Legendre and Elliptic Integrals 


3.1 Introduction 


As their name suggests, by 1800 elliptic integrals had an established place in contem- 
porary astronomy. Kepler’s 2nd law implies that a planet in an elliptical orbit sweeps 
out equal areas in equal times, so mathematicians had long grappled with questions 
involving the rectification of an ellipse, which is given by an elliptic integral. The 
topic acquired a new urgency in the early 19th century with the discovery of the 
asteroids. Adrien-Marie Legendre’s computation of the values of elliptical integrals 
therefore fitted very comfortably into contemporary astronomy. 

However, elliptic integrals had long perplexed mathematicians because they 
seemed impossible to evaluate. In the early 18th century an Italian mathemati- 
cian, Count Giulio Fagnano, showed that there were similarities between elliptic 
and trigonometric integrals, and when his discoveries were reprinted in 1750 they 
came to the attention of Euler, who drew further results along the same lines. 

Legendre’s contribution was to create a theory of elliptic integrals. He showed 
that they could be reduced to canonical forms, in which they depend on a single 
parameter, that they were intimately connected to two linear ordinary differential 
equations, and that they were involved in several important problems in mechanics. 
Equally remarkably, he calculated tables of values of these integrals that made them 
as useful as any other functions in the mathematics of the day. These tables continued 
to be used until the advent of modern personal computers, and it has been estimated 
that producing them on his own took Legendre three years.! 


‘Dominique Tournes, private communication. 
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3.2 Legendre’s Career 


Adrien-Marie Legendre was born into a well-to-do family in Paris in 1752 and was 
able to live on private means and devote himself entirely to research, until the French 
Revolution wiped out his savings. He was by then well-known as a mathematician, 
and he briefly became one of the three commissioners involved in the determination 
of the standard meter. He then wrote an introduction to Euclidean geometry, his 
Eléments de géométrie, which ran to many editions and became the work every 
aspiring applicant to the Ecole Polytechnique had to read, thus solving his financial 
problems.” After this he became involved in implementing the decimal system for 
angles (a reform that did not last) and peripherally involved in the Ecole Polytech- 
nique, examining students applying to the artillery school. In 1813 he succeeded 
Lagrange, who had just died, at the Bureau des Longitudes, where he remained until 
his death in 1833 at the age of 80. 

His scientific work was concentrated on a number of Eulerian themes: number 
theory, and a study of the new functions Euler had introduced into analysis. It brought 
him into painful collision with Carl Friedrich Gauss on a number of occasions. 
Legendre was the first to publish the method of least squares in statistics, but Gauss 
claimed, correctly that he had been the first to use it. Gauss also found fault with 
Legendre’s attempts to establish the parallel postulate as a theorem in Euclidean 
geometry in his Eléments de géométrie; the attempts were indeed flawed. However, 
his main contribution to mathematics is probably his elaborate theory of elliptic 
integrals, even though, as we shall see and as Legendre was please to acknowledge, 
it was to be eclipsed before the end of his life. 


3.3 Elliptic Integrals 


The paradigm elliptic integral is 


i dt 
u= f GB.1) 
_ 74 
: V1—t 


It measures arc-length along the lemniscate r? = cos 26, which is a curve in the 
shape of a figure eight. This is easy to prove, for differentiating the equation for the 
lemniscate gives 

rdr = —sin (20) dé. 


2See Dhombres (1985) for a discussion of mathematical publications in France after the revolution. 
He gives these sales figures for 1812: 3000 copies of the ninth editions of textbooks on geometry 
by Lacroix and Legendre, and 2000 copies of Monge’s work on descriptive geometry. In 1813 he 
estimated that 1500 copies of Lagrange’s book on the theory of functions and also of Lacroix’s 
textbook on the calculus were published. 
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The element of arc length in polar coordinates is given by 
ds’ =dr* +r°d0’, 


and the claim follows on eliminating @ and obtaining this expression for arc-length: 


The total arc length of the lemniscate is denoted 2w. 

The integral (3.1) can be regarded either as defining u as a function of v, or v as 
a function of u. In either case, one is led to look for analogues of the behaviour of 
the trigonometric functions, because the integral 


Uv 
/ dt 
u= | — 
d V1 — 1? 
may be taken to define the function v = sin uw. 


Fagnano and Euler 


In 1750 Count Fagnano, an Italian mathematician, submitted a book containing his 
life’s work to the Berlin Academy.* The Produzioni matematiche, as it was called, 
contained a remarkable analogy with the trigonometric integrals, that the Count had 
discovered in 1714. 

Arc-length along the lemniscate is a transcendental function of the parameter, r, 
but Fagnano showed that the values of the parameter for an arc and an arc of twice 
the length are algebraically related. More precisely, if 


VU W 
i dt =2f dt 
J JI-f# J V1—t4 


then v and w are algebraically related (we shall see this more generally below). More- 
over, he showed that the values of t that divide the lemniscatic arc into 2.2”, 3.2’, 
or 5.2” pieces are also algebraically related. 

The book was naturally sent to Euler, who had been interested in elliptic integrals 
for many years, and he immediately generalized Fagnano’s results.* He regarded the 
integrals as solutions to the differential equation 


dx dy 


= 
V1 — x4 Ji=y* 


(3.2) 


3See Enneper (1890, Note II). 


4See his paper E252 of 1752 and his later, more extensive treatment E251 (1760) that we follow 
here. There is an English translation of E251 by Stacy Langton on the Euler Archive. 
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and using the analogy with the trigonometric case 


dx dy 


=+ 
J 1 — x? Vl—-y? 


Euler guessed correctly and then verified (E251, §9) that the solution of the differ- 
ential equation (3.2) is 


Ly peg y = c? + 2xy(1 ey (3.3) 


where c is an arbitrary constant. This yields what is now called Euler’s algebraic 
addition theorem for elliptic integrals: 


x y c 
dt dt dt 

+ = | — (3.4) 
J v1—t4 s v1—t J Vv1—t4 
which says that the sum of any two arcs of a lemniscate is an arc whose parameter, c, 
depends only algebraically on the parameter values of the original arcs. Equation (3.4) 
and Euler’s equation (3.3) confirm what Fagnano had already noticed, that doubling 
the lemniscatic arc yields a quartic equation, whereas doubling of a circular arc yields 
only a quadratic equation. 


At this point the analogy seems to have deserted Euler, and he did not proceed to 
create a theory of ‘lemniscatic integrals’ 


Vv 
is i dt 
u=u(v)= | ——. 
J v1—t4 
Krazer, in the preface to Euler’s Opera Omnia (1), vol. 20, p. 21, argued that on this 
occasion Euler remained too close to the geometry of conics, and before Legendre, 


as Enneper pointed out (Enneper 1890, 542), “the writings of Fagnano seem to have 
been very little known, and the writings of the great Euler too little read”.° 


Legendre’s Elliptic Integrals 


Legendre was the first to study the lemniscatic integral from a purely functional point 
of view. After two memoirs occupied with geometrical questions, his (1788a) and 
(1788b), his ‘Mémoire sur les transcendantes elliptiques’ considered the new func- 
tions that the integrals define, as the sub-title makes clear: ‘Containing easy methods 
of comparing and valuing these transcendentals, which include elliptic arches, and 
which frequently occur in the application of the Integral Calculus’ .° 


In E251 Euler did go on to study more general integrands that involve square roots of quartics, but 
nothing that amount to a theory. 


Read to the Paris Academy in 1792. 
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Legendre began with a crucial simplification and three-fold classification of the 
way in which elliptic integrals can arise that has been employed by everyone ever 
since, and that Legendre rightly said in his Traité 33 years later was at the basis of his 
method. He first showed that any integral of the form [ Pax where P is a rational 
function in x and R is the square root of a quartic (with real coefficients), can be 
simplified to one of the form 


/ Odt . 
(l= #2) (1-21?) 


where Ee is another rational function. The substitution ¢ = sin @ reduces this to 


ere 
—c? sin? 4) 


Legendre called the variable ¢ the amplitude of the elliptic integral, the parameter 
c the modulus, and quantity b = 1 — c* the complementary modulus. He denoted 


the square root ,/ (1 —c? sin? d) by A or A(@), and showed how this reduction led 
to one of three distinct kinds, 


1. elliptic integrals jp are of the first kind; 
: 

2. elliptic integrals { Ad¢ are of the second kind’; and 
0 


x 
3. elliptic integrals [ Q are of the third kind. 
0 


o 

1+n sin 2 @) A 
Only in the third case did Legendre allow that the coefficient n could be imaginary, in 
which case, he observed, the integral would be accompanied by another multiplied 
by /—I, and the sum of two will always be real. This is typical of the way complex 
considerations could enter mathematical arguments at the time, but only on condition 
that the final expressions are purely real. He insisted that the modulus c and the 
amplitude ¢@ were real, and that c < 1. 

Legendre regarded all these integrals as functions of their upper end point x. He 
denoted by F a typical indefinite integral of the first kind, and a typical indefinite 
integral of the second kind by E (or sometimes G). The corresponding complete 
integrals, as they are called, he wrote as follows: 


m/2 n/2 
d 
ra [ and El= [ ado, (3.5) 
0 0 


He wrote F! (c) and E! (c) when he regarded them as functions of the modulus c. 


7The arc length of an ellipse is given by an integral of this kind. 
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Among other results, he now rederived Euler’s differential equation (3.2), in the 
form 
do _ dw 
Jl—-esinreéd VY1—c2sine wb 


He deduced that its integrals satisfy 


F@)+FW) =F), (3.6) 


where yp is an arbitrary constant. But when ¢ = 0, ~ = ps, so a comparison with 
Euler’s result shows that 


cos dcos y — singsin WA ({4) = cos LL. (3.7) 


He concluded the paper by indicating how tables of the values of F(@) could be 
calculated for specified values of the modulus c. 

It is clear that Legendre thought of himself as enriching mathematics in the man- 
ner of Euler with whole families of new, interesting functions. Legendre took the 
function-theoretic point of view even further in his Exercises de Calcul Intégrale 
(3 vols, 1811-1817). The books begin with his study of what he called the elliptic 
functions —meaning F (@), E (@), and related functions—but to avoid confusion 
with their inverses, which are called elliptic functions today, I shall call Legendre’s 
functions elliptic integrals. 

For a new function to be accepted into mathematics it has to be shown to be useful, 
and if it is to be applied then its values have to be known numerically. Legendre 
devoted Book III (1816) to the production of several sets of tables (which we shall 
consider later), and then put the new functions to work solving geometrical and 
mechanical problems.® 

In the Exercises Legendre, drawing on results in his study of elliptic integrals 
(1792), deduced by differentiating under the integral sign that the complete elliptic 
integrals satisfy two linear differential equations. The first one, 


F'=0 (3.8) 


(1 2) d’ F! i 1—3c? dF! 
c dc? Cc dc 


had already been given by Euler in 1750.? The second one is 


(1 2) Oi “ 1—c? dE! 
Cc 


E'=0. 
dc? cdc a Ge) 


8He also wrote extensively on Eulerian integrals, and computed tables for the Beta and Gamma 
functions. These had been introduced by Euler, who showed they were closely connected to elliptic 
integrals. 


°See Euler, Opera Omnia (1) vol. 20, p. 40. 
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Legendre solved these equations and exhibited power series expansions for the com- 
plete integrals F!(c) and E! (c). He also obtained a strikingly attractive result 
connecting complete integrals of the first two kinds with complementary moduli 
(c and b): 

T 

a= F! (c) E! (b) + F! (0) E! (b) — F! (6) F! (c) (3.10) 


The Pendulum and the Elliptic Integral 


In his (1792) Legendre had discussed the oscillations of a simple pendulum of 
length ¢. He set the constant c> = 2¢/h where h is the maximum height of the 
pendulum (in units where the acceleration due to gravity = 1). He defined the angle 
w as a function of the angle of displacement from the vertical, 0, by the formula 
csiny = sin(@/2), and deduced that the motion of the pendulum is given by the 
equation 

ldy 


dt = —______ 
1 —c? sin? w 


It follows that Legendre’s equation implies that the familiar approximate equation 
for small arcs under-estimates the time needed to make a complete swing. (To see 


this, compare dt and dt’ = wa and see that dt’ > dt.) Legendre was surprised 
> af 1—c?2 2 , 


to notice that he was the first to show that there were algebraic relations connecting 
the times of swings of a circular pendulum, just as there were for divisions of circular 
arcs. 

To see how Legendre obtained his equation, suppose that the pendulum (in the 
form of a weightless rigid rod of length € with a bob of mass m at the end) is set in 
motion from the vertical position with an initial velocity v. When it is at an angle 0 
with the vertical, the tangential force on the bob is mg sin 9, and so the equation for 
the rate of change of angular momentum is mb = —m g sin @. Divide through by m, 
multiply both sides by 6, integrate, and obtain 

62 
£ is gcos@ +a, 


where a is a constant determined by the initial conditions. If we take the initial 
condition to be £9 = v when 6 = 0 then we find that 


v2 


a= oy] —g. 
If we take the initial condition to be 6 = 0 when the bob is at its maximum height, 
given by cos? = 1 — h/€ then we find that 
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which with the previous result implies that 5U = gh, which is the equation for 
conservation of energy. 
So we write . 
£0 gh 
ae @—-1l4+e 
+ = geosd— 1) +5 


and make the half-angle substitution, giving cos 9 — 1 = —2 sin?(1)) and 6 = 2x), to 
obtain 


gh 
£ 2 2 le 
i = -gsin? b+ 2 
SO 
_ [gh fo 
Y=Vonvi ym? 


Set c? = 2/h and this becomes 


, gh : 
v= [opav 1 — sin’ d, 


which we write as 
dy _ v2 
dt \ 4¢2 


This can be rewritten and integrated to give!® 


dé 
t= 06 | aay 


2_ 4e9 

=a 
This was far from being his only incursion into mechanics. Legendre gave a 
thorough analysis of three problems in his Exercises (1817): the rotation of a solid 
about a fixed point; the motion of a body attracted to two fixed bodies (whether in 
plane or in space); and the attraction due to an homogeneous ellipsoid. In his Traité 
(1827) he added two more examples: motion under central forces, and the problem of 
determining geodesics on an ellipsoid, as well as two geometric problems: the surface 
area of oblique cones, and the surface area of ellipsoids. So this, his final book and 
one surely intended to be the definitive treatment of a topic that had engaged him all 
his life, was a prolonged contribution to the theory of real functions. It began with 
their definition, moved on to their fundamental properties, tabulated their values, and 
finally displayed their utility in solving significant problems in applied mathematics. 


where c 


10 As a check, note that the final result is dimensionally correct. 
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Computing Values of an Elliptic Integral 


Initially the analogy between the trigonometric and elliptic integrals led Legendre in 
a direction from which he recoiled, but which later investigators were to find highly 
suggestive. It had been known since the time of Ptolemy that to compute tables of, 
say, the sine function (or, mutatis mutandis, the chord function in Ptolemy’s case) 
one makes repeated use of the addition formulae: 


sin (u + v) = sinucosv + cos u sin v 


cos (u + v) = cosucos v — sinu sinv 
their important corollaries: 


sin2u =2sinucosu 


2 2 


cos2u = cos’ u — sin* u. 


and their inverses: 
of 1 1/2 
sin (5) = | ~(1 —cosu) 
2 2 
(5) = ee 
cos AI=\5 cos u 


together with known values of cosine and sine such as cosO = 1,sin0 = 0, 
cos 7/2 = 0, sina/2 = 1. 

Legendre showed how to calculate values of F' in Book I, §§ 17—22 of the Exercises, 
where he went on to give a table of values of elliptic integrals to 14 decimal places. 

He defined ¢,, to be an amplitude such that F (¢,) = nF (@) and sought equations 
connecting sin (¢,) and cos (¢,) with sin @ and cos ¢, under the condition that 0 < 
c < 1. The equations for cos (#2), sin(@2) and tan (¢2) came directly from the 
addition formula. Then identities between sin (¢,,41), sin (¢,—1) and sin (¢,,) enabled 
him to obtain formulae for sin (¢3), sin (¢4), and so on. 

The trisection formula, for example, relating sin (¢3) = a to sin (¢) = x, is 


(3 —4 (1 + ) x? + 6c2 x4 — ea) x 
1 — 6c2x4 + 4c? (1 + c?) x8 — 3¢4x8 = 


and this struck him as unexpectedly complicated. It is of degree 9 in x, not of degree 
3 as a naive analogy with trigonometry suggests. More generally, division into n 
parts leads to an equation of degree n?. 

This approach to constructing a table of values therefore seemed to be too difficult, 
and Legendre therefore preferred to go back to a method he had sketched in his 
(1788a) and his (1792). He had observed in his (1788a) that it was traditional in 


evaluating arc lengths along ellipses to make a substitution that transformed a given 
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ellipse into another, more circular, one. So Legendre showed how to find accurate 
approximations when c is nearly 0 or 1, using the fact that the elliptic integral is 
trivial when c = 0 or c = 1. Then he showed how to reduce the general case to this 
one by a transformations that either steadily reduced or steadily increased the value 
of the modulus. 

Legendre considered the transformation c! = — Iterating this transformation 
gives a series of values c,c’,c”,... tending rapidly to the value 1. So iterating 


it backwards gives a series of values c, C8 eC ,.. tending towards the value 0. 
l—b 


Explicitly, c? = eves, or, in terms of the complementary modulus, c? = Ts: 
For any given c successive values c, c°, c™ 
a quick use of the addition formulae. 

For example, the sequences c, c’, c”,... and c, c®, c®, ... Show that from c = 
1/2 the increasing sequence reaches 0.9999999764 in just three steps, and from 
c = 1/2 the decreasing sequence reaches 0.0000004174 in just three steps. 


Legendre defined ¢° by the equation 2 sin? ¢ = 1+ c° sin? ¢°— A® cos ¢°, where 


A° := ,/1 — c sin? 69, and found that 


,... can be found from the sine tables by 


00 


0 


F.(@) = se (°), where F°(¢°) -/{% (3.11) 


oO 


He then observed that tan (¢° — d) = btan@. So evaluating F at @ reduces to 
evaluating F 9 at ¢°, which is easier to do because is less than c, and iterating, 


F. (¢) = 


0 00 (n) 
d+c)d+e™)...d+e”) po (6). (3.12) 


Qn 


The convergence is quite rapid because, as Legendre observed, the value of c” 
becomes negligibly small after a small number of steps, and then A = | and 


F(¢) = @. Therefore he set ® = lim ae 
n—-> Oo 


and found, on setting the product 


(1 + e) (1 + en) ... = q, (it is a constant depending only on the choice of c, not 
the angle) that F = (1 + e") (1 + eo) ... = a®. Therefore F! (c) = 7a, and 
a was easy to find from logarithm tables. When ¢ = 5 the limit @ = 4, and the 
corresponding value of F is a5. 

You can check the rapidity of the convergence with this example given by 


Legendre. The value 
V2 (14+V3 comes 
cS 5 = sin75°, 


2 


is “unfavourable to calculation”, as Legendre put it, because c is close to | (it is in 
fact 0.9659258262), but Legendre showed that four iterations were enough to show 
that b°°°° vanished to seven decimal places, and log a = 0.2460561. He also showed 
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that when c is small 


1 1.3 1.3.5 
0_. -,2 oes 4 ead 6 a 
ee +(S)e +(FR)e ee 


from which is followed that it was often enough to use just the first two terms. 


3.4 Exercises 


1. Choose one or two of the following topics and consult the internet about them: 
Taylor series, MacLaurin series, Fourier series, differential, fundamental theorem 
of the calculus, elliptic integral, Lagrange, Fourier, Legendre. How intelligible, 
and how reliable, was the information you found, and how did you reach this 
decision? 

2. Read Fourier’s remarks about the convergence of his series and compare them 
with some Fourier series expansions. How do you interpret his remarks? 

3. How useful would the trigonometric functions have been if we did not have tables 
for them? What would you have to do to calculate a value of sin 1° to four decimal 
places, and how useful would such information be (think about rounding errors). 


Chapter 4 
Cauchy and Continuity 


Painting of Augustin-Louis Cauchy by J. Roller (= 1840). 


Augustin-Louis Cauchy (1789-1857) 


4.1 Introduction 


One of the key figures in the transition from 18th century analysis to modern mathe- 
matical analysis is Augustin-Louis Cauchy. Over the next several chapters we shall 
look at various aspects of his contributions to real and complex analysis, and we shall 
see that his impact is very much tied up with the complicated, and political, nature 
of his life. 
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In this chapter we begin to look at his teaching at the Ecole Polytechnique, which 
is where he introduced many of his fundamental ideas about continuity. This was a 
novel concept to stress as lying at the foundations of the calculus, and it was part of 
a wholesale reworking of the subject that he put forward after 1820. But it is also a 
difficult one, and we shall see that in his attempt to prove the binomial theorem for 
arbitrary exponents he made a revealing error that he only corrected much later in 
life, even though Niels Henrik Abel questioned it early on.' That he made an error, 
and that no-one corrected it for some time, show just how difficult the concept of 
continuity was to be to elucidate, and is one of the principal themes of this book. 
As we shall see by the time we have taken these developments as far as we can go 
(Chaps. 22-24) there are many problems connected with the concept of continuity 
that occupied the minds of many mathematicians, and it is hard to believe that Cauchy 
can have had a precise idea of what he was ushering in to mathematics. 

A particular point to watch for is the decline of verbal definitions and the rise of 
ones symbolically (using ¢, 6, and V) along with the increasing use of quantifiers V 
and J and attention to the order in which they occur; all evidence of the need for new 
levels of precision. 


4.2 The Young Cauchy 


Augustin-Louis Cauchy began teaching at the Ecole Polytechnique in November 
1815, when he replaced Poinsot, who had been in poor health. He was then twenty- 
six. The next year he was appointed a regular professor and he taught there until 
1830. 

Although Cauchy was already the author of several important papers, his appoint- 
ment as a professor was undoubtedly political.* Royalists were working across 
French society to replace supporters of the now-discredited Napoleonic regime, and 
they correctly saw Cauchy as one of their own. Monge, Lacroix and others among 
the old guard of the Ecole Polytechnique were forced to resign, and this made it easy 
to break with tradition and appoint as a professor someone who had not previously 
taught at the Ecole. A few months later the purge reached the Institut de France when 
the old Académie des Sciences was reestablished under its old name, Monge and 
Carnot were removed, and Cauchy and the obscure Louis de Bréguet replaced them. 

In these months of 1815 Cauchy won an Académie prize of 3,000 francs for a long 
paper on the theory of waves, and then resolved an outstanding conjecture of Fermat’s 
that every number is the sum of three triagonal numbers, four square numbers, five 
pentagonal numbers and generally n n-gonal numbers.° 


'Cauchy’s error has provoked much discussion among historians of mathematics, see for example 
Giusti (1984) and Schubring (2005). 


See Belhoste (1991, 45-50) upon which the next few paragraphs are based. 
3The kth n-gonal number is 5(n —2)(k2 —k) +k. 


4.2 The Young Cauchy 35 


Lagrange had proved this for square numbers in 1770, Legendre for triagonal num- 
bers in 1798, and Gauss had rederived these results in his Disquisitiones Arithmeticae 
of 1801, so to go well beyond what these men had done made Cauchy famous, and 
would have opened the doors of the Institut to him in any case, but the manner of his 
appointment turned people against him. Monge in particular was highly respected as 
a teacher and a founding father of the Ecole Polytechnique, and for Cauchy to join the 
Académie as part of what many intellectuals and scholars regarded as crude political 
manipulations without a hesitation struck many as unworthy. “Cauchy”, Bertrand 
wrote later, “found few defenders. He has seen more than one friend who, though 
naturally tolerant and decent, turned away and refused to call him ‘brother”’.* 

Royalists had long disliked the Ecole Polytechnique for its liberal and Bonapartist 
sympathies, and they seized upon a pretext in April 1815 to send the students home 
and reform the school. In September 1816 they demilitarised the school structure 
and completed the purge of the professors, thus incidentally making room to appoint 
Cauchy as a full professor, and his career prospered. 

As was customary, Cauchy was required by the Conseil of the Ecole to provide 
his students with textbooks. Accordingly, in 1821 he published the Cours d’analyse, 
devoted to algebraic analysis, followed two years later by the Résumé (1823), where 
differential and integral calculus was presented in, if not a rigorous form for the time, 
then in a form it proved possible to make rigorous.° 


4.3 Cauchy and Continuity 


In 1821, in his lectures at the Ecole Polytechnique Cauchy, defined and discussed 
a number of concepts with a remarkable, novel, degree of precision. These include: 
quantities, real functions, infinitely small and infinitely large quantities, continu- 
ity of functions, singular values of functions, real convergent and divergent series, 
rules for convergence, summation of some convergent series, the binomial theorem, 
imaginary expressions and their moduli, imaginary variables and functions, imag- 
inary convergent and divergent series, rules for convergence, summation of some 
convergent series, as well as other topics. 

Let us look at how some of these terms were defined. 

Algebraic analysis, said Cauchy, deals with numbers and quantities. It is not clear 
if the ‘quantities’ he had in mind were the abstract algebraic quantities that Lagrange 
had invoked, or something closer to the familiar measuring numbers associated with, 
for example, lengths. In any case, they may be variable or constant and are represented 
by letters. The crucial concept of limit he defined in these words (Cours, p. 4) 

When the values successively attributed to the same variable indefinitely approach a fixed 


value, in such a way as to finish by differing from it by as little as one wishes, this last is 
called the /imit of the others. 


4Quoted in Belhoste (1991, 46-47) from Bertrand Eloges académiques, 1902, vol. 2, p. 112. 
>There was only one mathematician more skeptical than Cauchy at this time, and that was Bolzano. 
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So an irrational number is a limit of a sequence of rational numbers, and a circle is a 
limit of a suitable sequence of inscribed polygons. An infinitely small number (Cours, 
p. 4) is a variable taking successive numerical values and decreasing indefinitely 
in such a way that it becomes smaller than any given number. An infinitely large 
positive number (written +00, Cours, p. 5) is a variable taking successive numerical 
values and increasing indefinitely in such a way that it becomes larger than any given 
number. Cauchy defined —oo similarly. The concept of an infinitely small quantity, 
a, of various orders a” was important for Cauchy—it is how he expressed many 
arguments involving limits—and he gave rules for handling them, noting that one 
must distinguish between decreasing constantly and decreasing indefinitely. 

It is perhaps unexpected that Cauchy tangled up the concept of a limit with talk 
of indefinitely small and infinitesimal quantities. Schubring (2005, 400) has argued 
that this has to do with a clash between mathematicians and the directors of the 
Ecole Polytechnique about how to teach the calculus. By 1810 there had devel- 
oped a general feeling among mathematicians that limits were a better concept than 
infinitesimals for this task, but the directors preferred the language of infinitesimals; 
Cauchy’s approach was therefore intended a something of a compromise that gave 
the directors the terminology they wanted and the students access to the limit concept. 
This does not involve us in accepting that Cauchy, of all people, wrote down some 
mathematics that he knew to be wrong or inadequate, because it it not simply a mat- 
ter of terminology: the use of infinitesimals has an intuitive quality that can suggest 
conclusions and proofs, and can be fairly easily replaced by limiting arguments. This 
is why the language of infinitesimals persisted well into the 20th century, especially 
in differential geometry, but it is a seductive language that, as we shall see, began to 
obscure more than it revealed and had eventually to be replaced in analysis. 

Multiply-valued expressions occur in many places in mathematics, most obviously 
in algebra, and there was a tendency throughout the 19th century to see them as many- 
valued functions. This became an issue in the early history of functions of a complex 
variable—not least for Cauchy himself—and we have already met them as integrands 
in the context of elliptic integrals, where they were to prove even more troubling. 
Cauchy introduced them in the Cours (see p. 7) by saying that, for example, there 


are two values to a? , of which the positive one is written a? or ./a. He then gave 
rules for multi-valued expressions, such as arcsin. Then he turned to give rules for 
evaluating the limits of certain expressions. 

A function is a variable that depends on another variable (Cours, p. 19), which 
is called the independent variable. Cauchy regarded uniform (i.e. single-valued) 
functions, many-valued functions (such as square or nth roots), and implicit functions 
as functions. 

Cauchy now turned to define the concept of continuity of a function (pp. 34-35). 
This was a radical step to take at a time when almost all mathematicians of the 
time took differentiability for granted: continuity had never been given a prominent 
position in the edifice of analysis before. He said that f is continuous between certain 
limits if, for each x within those limits the value of f(x) is unique and finite and 
the numerical (i.e., absolute) value of f(x + a) — f(x), where a is infinitely small, 
decreases indefinitely with a. Equivalently, an infinitely small increase in x produces 
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an infinitely small increase in f (x). This is not continuity at a point, a concept Cauchy 
never defined, but continuity on an interval. However, he did note that continuity can 
fail at a point. A little later (p. 41) he defined a function of several variables to be 
continuous on some domain if it is continuous in each variable separately.° 

Among the many examples of functions stated without proof to be continuous 
were linear functions (those of the form ax + by + cz +---) A*, sin(x), arcsin(x), 
and x. 


Theorems 


Cauchy then proved some theorems.’ 


He began with a theorem on the composition of functions (Cours, p. 41): If f 
and g are continuous within certain limits then the composite function f(g(x)) is 
continuous everywhere it is defined. 

Then came his intermediate value theorem (Cours, p. 43): If f(x) is continuous 
between xo and X, and b lies between f (xo) and f(X) then there is at least one x 
between xo and X such that f(x) = b. 

The proof that Cauchy gave in the body of the lectures is no proof at all, only 
an appeal to the geometric idea that the curve y = f(x) must cross the line y = b 
because the curve is continuous. But he was aware how to to do much better, as his 
lengthy Note of some 60 pages added to the lectures (Note 3 of 10) shows. Here he 
argued that if the interval [xo, X] of length h is divided into m equal parts then a 
comparison of successive values in the sequence of values 


Ff (x0), fo + h/m), f(xo + 2h/m),..., F(X —h/m), F(X). 
shows that there must be an consecutive pair of opposite signs. Let f(x,) and f(X1) 
be such a pair, so 


Xo < xy < X11 < X. 


Now subdivide again and repeat the argument, to obtaining a new consecutive pair 
x2 and X2 where f takes opposite signs, and note that 


Xo SX S42 < XQ SX SX. 
Continue in this fashion and two sequences are obtained: 
XO SAL SA2S MNS... 


and 
X= Xi >=XQ2z=-+-Xn=.... 


©Schubring (2005) has insisted on the point that for Cauchy a function is a function of a variable 
quantity; it will have a value at a point, but it is hard to think of a function of a variable being 
continuous at a point. 


7Cauchy’s notation has sometimes been changed, but not otherwise the statement of his results. 
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The first is monotonic non-decreasing, the second monotonic non-increasing and 
their successive terms differ by no more than 


h = X—xo, h/m, h/m’?,... 


so their terms, as he put it, “will differ by as little as desired and therefore must 
converge to a common limit”. Let a be that limit. Since the values of f on 
xo, f(%1), f (x2), ..., f(X2), f(X1), f(X) always remain of opposite sign “it is 
clear that the quantity f(a), which must be finite, cannot differ from zero.” There- 
fore there is a number a such that f(a) = 0 as required, and the theorem is proved. 

As Liitzen (2003, 167-168) pointed out, Cauchy’s otherwise exemplary account 
convergence has one fundamental flaw: he had no way of establishing the property 
of quantities (or real numbers, had he used the term) that modern analysis calls the 
completeness of the real numbers. He could not explicitly show that a sequence 
whose terms ultimately differ by arbitrarily small amounts actually has a limit and 
had to leave this point unexplained and unanalysed.® 

Cauchy then proceeded to draw out the implication of this result for the numerical 
solution of equations. 


Real Convergent and Divergent Series 


A series, said Cauchy (Cours, p. 123), (NB: this is not the modern definition) is an 
indefinite sequence ug, U1, U2, ... where the terms are given (one from the other) by 
a determinate law. It has (partial) sums s, = ug + uy, +uU2 +---+Uy_1, 1” arbitrary. 
It is convergent, he said, if the sequence of sums has a limit, otherwise divergent, and 
he gave the example of 1 + x +x? -+---, which has a sum only when the absolute 
value of x is less than 1. For a ‘series’ uo, U1, U2, ... to be convergent—by which 
Cauchy meant for the sum to be convergent—it is necessary that the u, decrease 
indefinitely as n increases, but this is not sufficient—for example, if u, = ‘ the 
series is divergent. Cauchy proved this by bracketing the terms so that it was made 
of successive runs of n terms 


1 1 
n+1 n+2 2n 
whose sum is greater than 7 
1 
A good example of a convergent series that Cauchy gave was uy, = cL. 


with the sum e = 2.7182818.... 
As he had to do if this theory was to be any use, Cauchy gave several tests for 
convergence. The first applied to series uo, v1, U2, ... all of whose terms are positive. 
The root test (Cours, p. 132): consider (un) and let k be the greatest of its limits. 
Then the series converges if k < | and diverges if k > 1. 


8More precisely, he could and did define what we call a Cauchy sequence, but he could not say in 
mathematically precise terms that it has a limit. 


4.3, Cauchy and Continuity 39 


The sum of a series formed term by term from two convergent series of positive 
terms is the sum of their sums. 

Cauchy also discussed series with positive and negative terms by passing to their 
absolute values. For them he stated and proved the ratio test (Cours, p. 143): the 
series converges if, in absolute values, Sutt has a limit k < | and diverges if k > 1. 
(Cours, p. 144) An alternating series of terms that decrease indefinitely in absolute 
value is convergent. 

He then made what has become in the literature in the history of mathematics 
his most celebrated mistake (pp. 131—132)—but we shall see that he and his con- 
temporaries regarded it differently, even when they noticed it was not correct as 
stated. 


Cauchy’s Attempt at the Binomial Theorem 


Cauchy now considered the sequence 


ug(x), uy(x), Sy Un (x), eins 
of functions defined on a given interval, and its partial sums s_)(x) = Pe, un. He 
supposed that the series converged to s(x), and defined the nth remainder r,,(x) by 
the equation 


S(X) = 8y(X) + rp (X). 


He then offered this theorem with the accompanying proof (Cours, pp. 131-132): 


When, the terms of series having the same variable, 


ug(x), Uy (X),..-, Un(X),... (4.1) 


the series is convergent, and its different terms are continuous functions of x, in the neigh- 
bourhood of a particular value attributed to this variable, then 


Sn, Tm, ands 


are also three functions of x, of which the first is evidently continuous with respect to x in 
the neighbourhood of the particular value under consideration. This done, let us consider 
the increases that the three functions receive when one increases x by an infinitely small a. 
The increase in s, will be, for all possible values of n, an infinitely small quantity; and that 
of r; becomes insensible at the same time as r, if one gives n a very large value. Therefore 
the increase in the function s can only be an infinitely small quantity. From this remark one 
immediately deduced the following proposition. 


Cauchy then proved a theorem on the term by term multiplication of series, which 
he used along with the above theorem to prove the binomial theorem (pp. 164—165): 
(1 + x) can be developed as this power series (when convergent): 


-—1 
1 ee) 2 


1 
- 1 1.2 
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He gave this proof: consider the sum of the series ® (4) = 1+ 4x+ Mu oe ee 
as a function of yu. It is a continuous function of 4, when —1 < x < +1, and one has 
@(p1).B(') = B+ p’) (by the theorem on the product of series). This functional 
equation for ® is solved as follows. 


®(m) = &(1)” = dd +x)” 


for all integer values of m and therefore for all rational values of m and therefore, 
because ® is a continuous function of ju, for all values of ju. 

A clear counter-example will show that Cauchy’s argument about the sum of 
a convergent series of continuous functions necessarily converging to a continu- 
ous function is wrong. Consider the continuous functions f,,n = 1,2,... that are 
defined as follows. For each positive integer n, 


=1 ifx<= 
fnx)=ynx ifot<x<? 
il hex 


Define u;(x) = fi (x), and uy(x) = fn(x) — fr-1(x) when n > 1, so 
Sn(X) = u(x) + g(x) +--+ + Un(X) = fn(x). 


The individual terms of the series, u,(x), are the difference of two continuous func- 
tions, and are therefore continuous everywhere, so they meet the requirement of 
being continuous in the neighborhood of an arbitrary point, in particular the point 
x =0. 

The sum of the series is the limit s(x) = limy-+oo 5, (x), and it is the following 
function 


-1 ifx <0 
s(x) = 7 0 ifx =0 
+1 ifx>0 


which is continuous except at x = 0. 
Now consider the remainder r,(x) = s(x) — s,(x). Outside the interval -i < 


x< i the remainder is zero. Inside this interval it is defined by 


-l-n ifst<x <0 

Tn(x) = 40 ifx =0 
l-nx if0<x<t 
It is true that as n becomes indefinitely large the remainder term becomes zero 
for each value of x, but this does not happen for all x simultaneously as n becomes 
indefinitely large. On the contrary, for each n there are values of x for which the 
remainder takes the value 1 (and others for which the remainder takes the value —1). 
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Evidently, with a modern understanding of the terms Cauchy has made a mistake. 
Indeed, we have to read the statement of the “Theorem’ charitably, because it invites 
the interpretation that a series is known to converge in a neighbourhood of a point 
if it is known to converge at the point, which is trivially wrong (it is enough to take 
Sn (x) = nx and follow the above counter-example). 

It is clear, on looking at Cauchy’s argument, that he was unclear about what it is 
for the terms r,(x) to become arbitrarily small. Does it mean that 


for all ¢ > O there is an N such that for all x in the interval under consideration 
n > N implies |r,(x)| < €? 

Or does it mean that 

for all ¢ > O and for all x in the interval under consideration there is an N such that 
n > N implies |7,(x)| < €? 


For our example the first claim is false, but the second claim is true, which shows 
that there is a distinction to be made here. However, Cauchy did not do so. Instead, 
he passed to the limit as n tended to infinity without considering that different terms 
of the series might behave differently, which our modern argument does. So it is 
interesting, and perhaps surprising, to see what happened at the time. 

The first person to comment was Abel, who took as his guide, he said (Abel 1826, 
221) “the excellent work of M. Cauchy “Cours d’ analyse de I’ école polytechnique”, 
which must be read by every analyst who aims at rigour in mathematical research”.? 
Abel observed (1826, 225) that 


it seems to me that the theorem admits exceptions. For example, the series 


1 1 
sin x sin 2x + — sin3x 
2 3 


is discontinuous for every value (2m + 1) of x, m being a whole number. There are, as one 
knows, many series of this kind. 


The series Abel gave had been known since Fourier’s time to be copies of the 
function y = ; between —7 and 7 and discontinuous, as Abel said, at all odd 
multiples of 7, where it takes the value zero, and presumably Abel took this example 
from Fourier’s work. It had indeed been known to Euler, who discussed it in 1783, 
and it had been picked up and commented on by Lacroix in his book of 1810. For 
pictures of the graph of this series, see Sect. 4.4. 

It is likely that Abel chose his words carefully to indicate that Cauchy’s argument 
was essentially correct but wrong in a way that could be put right, or, in other words, 
that Cauchy’s argument would be true in most cases but fail in some anomalous 
circumstances that had to be understood. For a discussion of Abel’s views in a 
context of the transition from formal, algebraic, analysis to concept-driven analysis, 
with an indication of how other mathematicians whom Abel knew saw exceptions 
to theorems, see the interesting account (Sgrensen 2005). We shall look at Abel’s 
attempt at providing a rigorous proof shortly. 


°Page references are to Abel’s Oeuvres completes, Vol. 1. 
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Certainly Cauchy knew this work, but he did not react when Abel published these 
remarks, and the obvious implication is that he did not consider them to be relevant 
to his ‘Theorem’? On the other hand, Abel did think this series was an exception. 

Let us look at the series in a neighbourhood of an odd multiple of 7. Each term is 
a sine function and therefore continuous everywhere. Each term takes the value 0 at 
every odd multiple of 7, so the series converges to zero at all odd multiples of 7. 

On any closed interval not containing an odd multiple of 7 the convergence is 
uniform. But plainly the sum is continuous only away from the odd multiples if 7. 
Abel read Cauchy’s ‘theorem’ as saying that if the series is convergent at a point xo 
and the individual terms of the series are continuous on a neighbourhood of xo then 
the series is also continuous on that neighbourhood. 

One way to focus this conflicting set of views is to stress that Cauchy never 
expressed a concept of continuity at a point, only on an interval—which gives him a 
concept more like uniform continuity. If you also take him to mean by convergence 
on an interval what was later meant by (and what today we mean by) uniform conver- 
gence, then his ‘Theorem’ is actually correct. Abel, on the other hand, may well have 
taken convergence to be what we mean by it today, convergence at a point, in which 
case Cauchy’s ‘Theorem’ is wrong and his counter-example shows that. However, 
what Abel wrote when defining continuity and convergence is almost identical to 
what Cauchy wrote. Thus (Abel 1826, 220): 


An arbitrary series 


will be said to be convergent, if for continually increasing values of m, the sum vg + vy + 
v2 +-+++ Um indefinitely approaches a certain limit; This limit will be called the sum of the 
series. 


and (Abel 1826, 223): 


A function f(x) will be said to be a continuous function of x between the limits x = a and 
x = bif, for an arbitrary value of x between these limits, the quantity f(x — (3) indefinitely 
approaches the limit f(x) for continually decreasing values of 3. 


It looks as if they wrote the same words and could have meant something different 
by them. 

The picture becomes even more complicated if we look at how Abel himself 
tackled the binomial theorem.!° He replaced Cauchy’s faulty theorem with 


Theorem IV: If the series 
f(@~= vo + va + we? +++ up, be 


is convergent for a certain value 6 of a, it will also be convergent for every value 
less than 6, and if one supposes that a is less than or equal to 6, then the function 
f(a— £3) will approach the limit f(a) indefinitely for continually decreasing values 


of (3. 


10Here we follow Liitzen (2003, 178). 
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He followed this with 


Theorem V: Let 
vo + vid + 0267 + --- 


be a convergent series in which vo, v1, v2, ... are continuous functions of the same 
variable quantity x between the limits x = a and x = J, then the series 


f(a) =v + vja+Wa*+--- 


where a < 6 will be convergent and a continuous function of x between the same 
limits. 

Theorem V can be used to replace Cauchy’s faulty theorem and thereby provide 
a valid proof of the binomial theorem. However, Abel’s proof of TheoremIV does 
not make clear where the uniform convergence comes into play, and in any case he 
did not use that property in the crucial Theorem V. As Liitzen points out (2003, 178), 
Kronecker was to observe correctly that Abel’s proof is flawed at this point. 

It is also worth noting, as Viertel has pointed out (2012, 57-59) that when Enne 
Heeren Dirksen reviewed the German translation of Cauchy’s Cours d’analyse in 
1829 he explicitly recorded his doubts about this theorem, writing in the third person 
that “the proof is not sufficient, and it seems to him on a closer consideration that 
the validity of the theorem itself is in doubt”.!! He came to this opinion, or at least 
was able to express it, because he handled the equation with more formalism and 
less reliance on words than Cauchy had used, and he used his notation to show that 
similar reasoning to Cauchy’s could be used to show that 0 = 1. But he did not 
produce a counter-example to Cauchy’s claim. 

Dirksen is described in Schubring (2005, 471-475) as an adherent of the German 
combinatorial school who took his Ph.D at Géttingen in 1820 and then went to Berlin, 
where he became a professor in 1824. He worked for a long time on a long, and 
largely unpublished, manuscript called the Organon, and in it he defined continuity 
of a function this way: 


If x designates an independent variable, f(x) an appropriately defined [function], and xo a 
specific value of x, it is said that f(x) is continuous for the specific value x9 and remains 
so insofar as its special value for xo of x is completely determined by x and is equal to the 
value of the limit of f(x) for the limit xg. 


As Schubring observes, this is quite clearly continuity at a point. He goes on to remark 
that Dirksen, influenced by his mathematical training, which emphasised algebraic 
formalisms, was the first person to devise a notation that allowed mathematicians to 
handle two limit processes at once (n — oo, x — 0), and so to produce the example 
that Viertel was to discuss later. 

Another interesting perspective on this is provided by Dirichlet’s investigations, 
which we shall discuss in later chapters (see Chap. 14). 


'l Quoted in Viertel (2012, 57 n. 55). 
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Very importantly, Cauchy himself returned to this question in 1853, when Charles 
Briot and Claude Bouquet, two young French mathematicians whom he supported, 
questioned the ‘Theorem’, and simply admitted he had been wrong. As he put it, 

As MM. Briot and Bouquet have remarked, this theorem can be proved for power series 


arranged in ascending powers of a variable, but for other series it cannot be admitted without 
restriction. 


He now argued that 


If one calls n’ an integer larger than n the remainder r,, will be nothing other than the limit 
to which the difference 


Sn! — Sn = Un + Ung Pr Uy (4.2) 


will converge for increasing values of n’. Now let us suppose that in attributing to n a 
sufficiently large value one can make, for all values of x within the given limits, the modulus 
of (4.2) (whatever n’ may be) and therefore the modulus of r, less than a number ¢ that is 
as small as one wishes. As an increase attributed to x can still be supposed to be sufficiently 
close to zero for the corresponding increase in s, to have a modulus less than a number as 
small as one likes, it is clear that it is enough to attribute an infinitely great value to the 
number v and an infinitely small value to the increase in x, to establish the continuity of the 
function 

S=Sn+Tn 
between the given limits. 
But this proof evidently supposes that expression (4.2) satisfies the condition stated above, 
i.e. that this expression becomes infinitely small for an infinitely large value attributed to the 
integer n. Moreover, if this condition is met the series (4.1) will evidently be convergent. 
Consequently, one has the following theorem 


Theorem 1 If the different terms of the series 
uo(x), U1 (X),.--,Un(X),.-- 


are functions of the real variable x that are continuous, with respect to this variable between 
given limits, if, moreover the sum 


Un +Untt +--+ +Uy/-1 


always becomes infinitely small for infinitely great values of integers n and n’ > n, then the 
series (4.1) will be convergent and the sum of the series (4.1) will be a continuous function 
of the variable x between the given limits. 


One might still wish for a little more precision, but the sentiment is clear: Cauchy 
now inserted a clause that makes the appropriate values of n and n’ depend on x, 
which clears up his earlier ambiguity. He then showed that Abel’s counter-example 
does not apply to the new theorem. !” 

The most likely explanation for this confusion is that everyone was confused. The 
natural idea of continuity on an interval, which might be taken to mean uniform con- 
tinuity in modern parlance, was not perhaps properly distinguished from continuity 


'2Tn view of the fact that some mathematicians have attempted to interpret Cauchy’s analysis using 
modern non-standard analysis, which can make Cauchy’s “Theorem’ correct, it is worth noting that 
Cauchy still used the language of infinites in 1853, but now to rectify his previous claim, which 
makes it unlikely that he had changed his mind about infinites and infinitesimals. See Chap. 22 
below. 
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at a point, which is not a natural intuition to have. Cauchy had perhaps thought, in 
1821, that the theorem was true except when it was obviously false, and in 1853 
had decided that such slipperiness would not do. But the clearest thing is that even 
these mathematicians found the matter confusing, and were not, in the end, entirely 
consistent or clear. The lack of a good notation, and the heavy reliance on verbal 
statements, obscures delicate questions such as the dependence of r;,(x) on both n 
and x, and this seems to have allowed mathematicians to misunderstand the problem 
and to have been misunderstood by their contemporaries. 

We shall return to this question when the concepts of continuity and uniform 
continuity began to be more clearly expressed and disentangled in the 1840s and 
1850s (see Chap. 22). 


4.4 Abel’s ‘Exception’ and Cauchy’s Series 


Abel’s Series 


Abel’s exception to Cauchy’s theorem is the function 


[oe 


_y)ktle 
Ab(x) := pe 
1 


It is the Fourier series for the function y = x/2 from —7 to 7. Figure4.1 is a graph 
of the first 10 terms of this series. 

Figure 4.2 is a graph of the first 100 terms of this series: Notice how nearly straight 
the diagonal parts are—they are in fact very wiggly smooth curves that oscillate fast 
but not by very much around the eventual limit curve. Notice too how unhappy Maple 
gets at the points where the Abel function will be discontinuous. 

Figure4.3 is a graph of the result of differentiating the first 100 terms of this 
series term by term. It oscillates wildly, and is not at all like the derivative of the 
limit function (which is the constant function 1/2 everywhere that the Abel function 


Fig. 4.1 The first 10 terms 
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Fig. 4.2. The first 100 terms 
of the series for Ab(x) 15 
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Fig. 4.3. The first 100 terms 204 
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is differentiable). This hints at how the theory for the differentiation of series will be 
much more complicated than that for continuous functions. 


Cauchy’s Series 


Here for comparison are graphs of the first 10 terms (Fig. 4.4) and the first 100 terms 
(Fig. 4.5) of the very similar series Cauchy used to illustrate his mistake: 


Co 


Cas x alc 
1 


In 1853 Cauchy knew that this series does not converge uniformly. The next graph 
Fig. 4.6 shows this clearly. It shows the difference between the sum of the first 500 
and the sum of the first 30 terms, so it gives an impression of the tail 


. sin (kx) 
= 
30 


It is very clear that the convergence is unlikely to be uniform on the interval (0, 277). 
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Fig. 4.5 The first 100 terms 
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A point worth exploring here is this. Consider a small neighbourhood of the origin, 
one that does not contain an odd multiple of 7. To be sure, Abel’s series converges 
uniformly here. A graph of the first NV terms, say for N = 10,00,000, will look 
indistinguishable from a straight line. Does this mean that the slope of the derivative 
is very nearly constant (say equally indistinguishable to the naked eye)? Term by 
term differentiation of the truncated series involving just the first N terms is certainly 
allowed. Consider now the behaviour of the series obtained from Abel’s series by 
forgetting the Nth term. It looks very like the full Abel series, but differs from it by 
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a function that oscillates very fast indeed, going through N = 10,00,000 oscillations 
as x goes from —7 to 7. This hints at the fact that to require a series of continuous 
functions to converge to a continuous function is not asking very much of the limit, 
but to a series of differentiable functions to converge to a differentiable function is 
asking rather more. 

Precisely this point was made by Abel in a letter to his friend Bernt Michael 
Holmboe in a letter on January 1826.'* He wrote 


One applies all operations to infinite series as if they were finite, but is this permissible? I think 
not. — Where is it proved that one gets the differential of an infinite series by differentiating 
each term? It is easy to give an example for which this is not true, e.g. 


1 1 
xX = sinx sin 2x + = sin 3x 
2 3 


Differentiation gives 


las cos 2x + cos 3x — ---etc. 


a result which is quite false because this series is divergent. 


13 Quoted in Sgrensen (2005, 468). 


Chapter 5 
Cauchy: Differentiation and Integration 


5.1 Introduction 


Here we look at Cauchy’s treatment of differentiation and integration. Cauchy not 
only reversed a century of tradition that regarded integration as essentially the inverse 
of differentiation, he brought a raft of new ideas and theorems to the subject. His 
foundational ideas did not simply underpin existing theory and leave mathemati- 
cians free to go about their business as before, they forced a wholesale rewriting 
of the subject that required new methods and markedly raised standards of rigour.! 
Examples of this that we discuss in this chapter include his mean value theorem 
and the proof that continuous functions are integrable. Moreover, although Cauchy’s 
emphasis was on what can be done and on what can now be properly established, by 
defining what a continuous function is, what it is for a function to be differentiable 
or to have an integral, Cauchy opened the door to the idea that functions might fail 
to have these properties, an impression that was strengthened by his insistence that 
functions defined by series may only have these properties on restricted domains. 


5.2 Cauchy’s Criticisms 


Cauchy’s Avertissement or preface to his Résumé contains the following striking 
passage: 


The methods that I have followed differ in many respects from those found in works of 
the same kind. My principal aim has been to reconcile rigor, which I had made a law unto 
myself in my Cours d’analyse, with the simplicity that results from the direct consideration of 
infinitely small quantities. For this reason, I have thought it necessary to reject the expansions 
of functions in infinite series whenever the series thus obtained are not convergent; and I 


'T am indebted to Jesper Liitzen for stressing to me in conversation what he calls the ‘new archi- 
tecture’ of mathematical analysis that Cauchy introduced, a point he makes in his (2003, 161). 


© Springer International Publishing Switzerland 2015 49 
J. Gray, The Real and the Complex: A History of Analysis in the 19th Century, 
Springer Undergraduate Mathematics Series, DOI 10.1007/978-3-3 19-23715-2_5 


50 5 Cauchy: Differentiation and Integration 


have been forced to return Taylor’s formula to the integral calculus. This formula can no 
longer be admitted as general unless the series that it includes is reduced to a finite number 
of terms and completed by a definite integral. I am not unaware that the illustrious author of 
the Mecanique analytique [Lagrange] took the formula in question as the basis of his theory 
of derived functions. But, despite all the respect that I have for such a great authority, the 
majority of geometers now agree in recognizing the uncertainty of the results to which one 
can be led by the use of divergent series, and I add that in many cases the Taylor theorem 
seems to furnish the expansion of a function in convergent series, even though the sum of 
the series essentially differs from the proposed function. For the rest, those who read my 
book will I hope be convinced that the principles of the differential calculus and its most 


important applications can easily be set out without the use of series (Cauchy, Résumé, pp. 
9-10). 


Despite the polite, diplomatic language, Cauchy here made it clear that he was 
offering a profoundly different account of the fundamental processes of the cal- 
culus. Compare it with what Lagrange had published only a few years earlier, in 
a supplement to the lessons he had given on the theory of functions at the Ecole 
Polytechnique?: 


Every function of one variable only can always be regarded as an exact derivative; for, if it 
does not naturally have a primitive function, one can always find one by series ...by resolving 
the given function in a series of powers of the variable and then taking the primitive function 
of each term. 


In the same volume of the Journal de I’Ecole Polytechnique that contained 
Lagrange’s supplement, André-Marie Ampére, after whom the amp is named, pub- 
lished a long note (Ampére 1806) in which he claimed to give “a new demonstration 
of the Taylor series”. He disagreed with Lagrange’s assumption, but nonetheless 
sought to prove that a continuous function is differentiable, indeed infinitely dif- 
ferentiable, and that every function f(x) can be expanded in a series of increasing 
powers. He began by giving a new definition of the derivative, which he stated in 
these terms*: 


The derived function of f(x) is a function of x such that 2s is always included 
between two of the values that this derived function takes between x and x + i, whatever x 


and i may be (see footnote 4). 


This definition was, he said, ‘the most general and the most rigorous possible’, and 
he justified it by basing it on a property of derived functions found by Lagrange and 
which he stated this way: 


Let A and K be the values of f(x) corresponding to x = a and x = k. One can always 
suppose that a and k are chosen in such a way that f(x) never becomes infinite in the 
interval and that neither does A equal K nor does a equal k, so that fs is neither zero nor 
infinite, and that one can take this quantity for that above or below which [i.e. as an upper 
or lower bound, JJG] one can always take (f(x +i) — f(x))/i, giving toi a value as small 


as necessary. 


2Quoted in Bottazzini, The Higher Calculus, 118. 
3See Lagrange (1806, 364), quoted in Bottazzini, The Higher Calculus, 118. 
4See Ampére (1806a, 151), quoted in Bottazzini, The Higher Calculus, 119. 
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Cauchy respected Ampére’s work and his lectures at the Ecole Polytechnique, 
he even taught courses with him—they alternated in alternate years—and he cited 
Ampére among those to whom he is indebted in both the Cours d’analyse and in 
the Résumé. But, unpersuaded as he was by the whole idea of Lagrange’s approach, 
Cauchy reversed the whole formulation of the problem. He defined the derivative of 
a function (if it exists) as a suitable limit, with the result that Ampére’s definition 
became a property of the derivative, expressible as a theorem. 


5.3 Cauchy’s Definitions 


Differentiation 


Cauchy began by repeating the definitions of limit, infinitesimal, and continuity from 
the Cours d’analyse. Then he wrote 


When the function y = f(x) remains continuous between two given limits of the variable 
x and one assigns to this variable a value included between the two limits in question, an 
infinitely small increase in the variable produces an infinitely small increase in the function 
itself. As a consequence, if one then sets Ax = i, the two terms of the ratio of differences 
a = fat) ie) will be infinitely small quantities. But, while these two terms will indef- 
initely and simultaneously approach the limit zero, the ratio itself can converge towards 
another limit, either positive or negative. This limit, when it exists [my emphasis, JJG], has 
a determinate value for every particular value of x, but it varies with x. ...The form of the 
new function that will serve as the limit of the ratio LGt)-f@) will depend on the form of 
the proposed function y = f(x). To indicate this dependence, we give the new function the 
name of derived function, and designate it, with the aid of an accent, by the notation y’ or 
f'@? 


In his next lecture Cauchy connected this with the familiar am He defined the 


differential of the function f(x) as “the limit towards which the first member of the 
equation 


f(x +ah) — f(x) _ f@+D-f@), 


a 1 


where i = ah converges when the variable w approaches indefinitely close to zero 
while the quantity / remains constant” (Résumé, p. 27). When f (x) = x, the equation 
df (x) = hf'(x) reduces to dx = h, whence df (x) = f’(x)dx, or, equivalently, 
dy = y'dx. Therefore, said Cauchy, the first derivative can be written as dy /dx, that 
is, as the ratio between the differential of the function and that of the variable. 

Cauchy’s thorough-going re-write of the foundations of the calculus is based on 
the next result, which is his mean value theorem.° 

If, the function f(x) being continuous between the limits x = x9, x = X, one denotes by 


A the smallest and by B the largest of the values that the derived function f’(x) assumes in 
this interval, the ratio of finite differences 


5Cauchy, Résumé, pp. 22-23, in Bottazzini The Higher Calculus, 120. 
®See the Résumé, pp. 44-46. 
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f(X) — f (xo) 


X — xo 


(4) 
will necessarily be contained between A and B. 


He gave this proof. 


Let us denote by 45, ¢ two very small numbers, the first being chosen in such a way that for 
numerical values of k less than 6 and for an arbitrary value of x between xo and X the ratio 


fx +k) — f(x) 
k 


is always greater than f’(x) — € and always less than f’(x) + ¢. If between these limits one 
interposes n — | new values of the variable x, let us say 


X1,%2,-+-,Xn—-1 
in such a way as to divide the difference X — xo into elements 
X1 — X0,X2 —X1,...,X —Xn-1 
which, all being of the same sign, have numerical values less than 6, the fractions 


f(%1) — fo) f(x2) — fr) f(X) — f@n=1) 


og 


x1 — x0 x2 — X1 X —Xn-1 


(5) 


will be found, the first between the limits f’(xo) — ¢, f’(xo) + €, the second between the 
limits f’(x1) — €, f’(x1) +6, ...all being greater than the quantity A — ¢ and all less than 
the quantity B + ¢. (Here, Cauchy used the result that if ¢ < § then § < {£5 < § which 
was better known then than now.) Moreover, the fractions (5) having denominators of the 
same sign, if one divides the sum of their numerators by the sum of their denominators one 
obtains a mean fractions, that is to say one lying between the smallest and the greatest of 
the ones considered (see Analyse Algébrique, Note Il, Theorem XII). The expression (4), 
with which the mean coincides, will therefore itself be contained within the limits A — ¢ and 
B + e, and as this conclusions persists however small the number ¢ shall be, one can say 


that expression (4) will be contained between A and B. 


Cauchy immediately deduced that 


Corollary If the derived function f (x) is itself continuous between the limits x = xo, x = X, 
in going from one limit to the other the function varies in such a manner as to always remain 
between the two values A and B and successively takes all the intermediate values. Therefore 
any mean quantity between A and B will be a value of f’(x) corresponding to a value of x 
between the limits x9 and X = xo + h, or, which comes to the same thing, a value of x of 
the form 

xo + Oh = x9 + O(X — x0), 


@ denoting a number less than unity. In applying this remark to expression (4), one concludes 
that between the limits 0 and | there is a value of 6 that satisfies the equation 


X)— f(x 
FOO) = FOO) pty. 4 0(X —19)] 
X — xo 
or, which comes to the same thing, the following: 


F(xo +h) = Fo) _ 
h 


f' (xo + 9h) (6) 
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This last formula being substituted whatever the value of x represented by xo provided that 
the function f(x) and its derivative f’(xo) remain continuous between the extreme values 
xX = XxX0,x = x9 +/A, one will generally have, under this condition, 


fix+h)— f(x) 
h 


= f'(x + 6h) (7) 


whence, on writing Ax in place of h, one deduces 


f(x + Ax) — f(x) = f'(x + OAx)Ax. (8) 


It is essential to observe in equations (7) and (8) that 6 always denotes an unknown number 
less than unity. 


Grabiner (1981, 115) has observed that in the course of proving this theorem, 
an interesting shift occurred away from the language of infinitesimals or limits and 
towards the arithmetic of inequalities when Cauchy began his proof by writing 

Designate by 5 and ¢ two very small numbers: the first being chosen in such a way that, 


for numerical values of i less than 5, and for any value of x between xo and X, the ratio of 
(f(x +i) — f(x))/i always remains greater than f’(x) — € and f’(x) +. 


This is the first appearance of the delta—epsilon notation, although it is modelled on 
more verbal expressions of the same kind in the Cours d’analyse, and it became a 
growing presence in Cauchy’s work.’ 

This proof merits some careful discussion.’ Cauchy assumed that a single choice 
of 6 and ¢ could be made which works simultaneously for all of x1, x2, ..., X,—1. 
This will not always be the case. He also assumed that the derived function f’(x) 
is bounded in the interval. This too is not always the case. These assumptions can 
be made to hold if the derivative is assumed to be continuous in the interval [xg, X]. 
Contrariwise, when Cauchy does assume the continuity of the derivative, in the 
corollary, he need not have done so. The intermediate value property holds for the 
derivative even if it is not continuous (a result later proved by Darboux). Cauchy’s 
genuine insights nonetheless left quite a delicate muddle to sort out! 

These two examples, taken from Bressoud (1994), illuminate these subtleties. 


1. Bressoud (1994, 89): The function f(x) = x’ sin(x~?) is differentiable in 
[0, 1] but there is no ¢ for which there is a 6 such that |x — a| < 6 implies 


LO-F@ _ FI(x)| < € for all a € [0, 1]. 


x—a 

2. Bressoud (1994, 81): The function f(x) = x43 sin(1/x) is differentiable in 
[—1, 1] but the derivative is not bounded there. Note, by the way, that for this 
function f’(0) = 0, contrary to what any attempt at graphing it might suggest 
and consistent with the fact that the formal expression for the derivative does not 
makes sense on substituting the value x = 0. This underlines the fact that the 
derivative is defined not as an expression but as a limit. The expression does have 
a limiting value as x —> 0—but you must prove that. Check that the derivative is 
not continuous at x = 0. 


7See also Bottazzini, The Higher Calculus, 120. 
8See Bressoud (1994, 81). 
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Cauchy was also aware that there are problems with quantities defined as limiting 
values, and commented that singular values, those obtained only as a limit, may 
present real difficulties, as for example with x* at x = 0. He therefore gave a 
number of tests for convergence appropriate to that delicate case. 

Lagrange had started his account with a (flawed) argument to show that every 
function admits a Taylor series expansion. Cauchy’s much more careful defence of 
this idea was restricted to functions that, with their first n derivatives, are continuous 
within the z-interval [0,4], and it comes at the conclusion of his lectures on the 
infinitesimal calculus (Résumé, pp. 211-219). There he gave what is nowadays known 
as Cauchy’s form of the Taylor series with remainder: 


(n—1) hn 


/ h? " h (n—1) 
eee eae ar | (x) +-:-+—_—__f (x) + 


(n) 
GD! a7 fe en): 


O<@<l. 
The remainder can also be written as an integral: 


x+h _5yn 
fo Sow 


and he observed that when the integral tends to zero with increasing values of n it 
serves to expand the function f(x + h) in series ordered according to the ascending 
integral powers of the quantities x and h. 

Cauchy concluded with a remarkable observation that is the first hint that the world 
of continuous, and even differentiable, functions may be much larger than the world of 


functions defined by power series. If the Taylor series f (0) +x f'(0)+ a f"(O)+:-: 
is convergent, then one might think that its sum is the function f (x), and in particular 
if all the terms of the Taylor series vanish than the function itself vanishes—but this 
is not necessarily the case. In fact, he said,” 


to be certain of the contrary it is sufficient to observe that the second condition will be 
fulfilled if we suppose f(x) = e~!/*, and the first if we suppose f(x) = en* + ex), 
However, the function e~“/ xy is not identical to zero, and the series derived from the last 
+ eU/*) as its sum, but its first term eW lx), 


supposition does not have the binomial ex 

The observation is profound. Hitherto, every mathematician had believed that 
every function can be expanded in a Taylor series, trivial cases aside, and that their 
task was to prove it—if indeed they felt that a proof was called for. Cauchy had now 
shown that it was possible to define a function that did not agree with its Taylor 
series. This not only destroyed the foundations of the Lagrangian calculus, it opened 
up the much more disturbing question of the relationship between a representation of 
a function and the function itself. By then Cauchy may well have known that much 
graver problems were to present themselves with the representation of a function 
by its Fourier series. Finally, this example suggested that functions may not, after 


°Cauchy Résumé, 229-230. 
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all, submit naturally to the operations of the calculus. Cauchy’s new definitions 
and theorems opened up a domain that only they were precise enough to allow 
mathematicians to explore. 


The Definite Integral 


In the century since Newton and Leibniz the fundamental theorem of the calculus 
had become regarded as allowing the integral of a function to be regarded as the 
opposite of its derivative; the integral of a function f(x) is a function F(x) with the 
property that iF (x) = f(x). As such, it is determined up to an additive constant. 

Cauchy firmly reversed this trend, and restored an independent existence on the 
integral. In the second part of the Résumé of 1823 he defined the integral as a limit 
of sums of areas. 

The Cauchy integral of a function of a real variable y = f(x) that is continuous 
in a given interval was defined as follows. He divided the interval into n equal 
subintervals [x;_1, x;], and considered the sum S = >°,(4; — x;~1) f (xi-1). The 
integral will be the limit of this sum as n tends to infinity. 

To prove that this limit exists, Cauchy considered a different sum: 


S’ = DiGi — xii) f Gi-1 + 6-10 — xi-1)). 


In this expression, the function f is evaluated at the point where it takes its average 
value (by the mean value theorem). Now, 


f (int + 67-1 — Xi-1)) = fOG-1 + &), 


for some ¢;. So 


S = 0G — mi) F Gi) + Do ei — 1-1). 


The second part of the sum can be made arbitrarily small because f is continuous, 
so every € will ‘differ very little’ from zero. Here, not for the first time, as we have 
seen, Cauchy failed to see that choosing a 5 that works simultaneously for every 
subinterval (and depends only on the ¢) cannot be guaranteed but must be proved to 
be possible. In fact, as later mathematicians were to show, this can be done because 
a continuous function on a closed interval is uniformly continuous. 

By repeating the reasoning and implicitly reverting to the ‘Cauchy criterion’ that 
he had demonstrated in his Cours d’analyse Cauchy concluded!” 

if we let the numerical values of these elements decrease indefinitely by increasing their 

number, the value of S will end by being sensibly constant or, in other words, it will end 

by attaining a certain limit that will depend uniquely on the form of the function f(x) and 


the extreme values xo, X attributed to the variable x. This limit is what we call a definite 
integral. 


'0Cauchy, Résumé, p. 125, Bottazzini, The Higher Calculus, 144. 
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By later standards, Cauchy should have considered the effect of dividing the 
interval into unequal pieces, which he did not. But he did consider certain kinds of 
discontinuous functions, those in which the function becomes infinite at one or more 
points of the interval (his argument allowed finitely many such points). So it is not 
correct to say, as the literature sometimes does, that Cauchy defined the integral only 
for continuous functions. 

Cauchy’s method was to consider the obvious limit, and to say that the function 
has an integral if this limit exists (Cauchy Resumé, p. 146). More precisely, let us 
suppose that the function f(x) is defined on the interval [x9, X] and continuous 
there except at the interior point x = a. Then it makes sense to consider the integrals 


hg Sf (x)dx and ae f (x)dx and their sum 


xX 


i f(x)dx + t(x)dx. 


0) a+é 


Cauchy defined the limit of this sum when it exists as ¢ — 0 as the principal value 
of the integral f 3 and said it was a singular integral. 

There are two ways to think of this. One is that mathematics is about functions. 
Something one does with functions is integrate them, and here is a way of integrating 
some functions that do not immediately seem to be amenable to integration. The 
second way is that mathematics is about operators, such as integration, that operate 
on functions, and here is a way of making that operator work on a wider class of 
functions. The difference in points of view may not be very apparent in this example, 
but we shall see that analysts were at least as concerned with getting a theory of the 
integral that worked well as with understanding functions. 


The Indefinite Integral 


This work led Cauchy to the concept of the indefinite integral. In the Avertissement 
to the Résumé Cauchy had written!! 


it seemed to me necessary to demonstrate the existence of the integrals or primitive functions 
in general before making their various properties known. To this end, it was first necessary 
to establish the idea of integrals taken between given limits or definite integrals. 


Now that Cauchy had defined the integral without reference to differentiation, he 
had to reconsider the formula that is sometimes called the fundamental theorem of 
the calculus: 


b 
/ f(x)dx = F(b) — F(a), 


where F’(x) = f(x). 
Cauchy argued as follows. Set T(x) = Hs Ff (x)dx, so 


T(x) = (X — x0) fo + A(X — x0)), |@| < 1. 


Résumé, 10. 
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Then 
X+a Xx 
/ St (x)dx -{ f(x)dx =af(x + a6), 
x x0 


0 


whence 
T(x+a)—T(x) =af(x+ 6a). 


If f is finite and continuous in the neighbourhood of a point x, the same will be 
true for T, and by passing to the limit as a — 0, Cauchy deduced that T’(x) = f (x). 

Cauchy then showed that, if w’(x) = 0 then w = constant on an interval and 
hence (Resumé p. 154) that “the general value of y, the solution of the equation 
dy = f (x)dx, is given by y = ie Ff (x)dx + constant”. This is what he called the 
indefinite integral. 

Cauchy concluded the Résumé with a discussion of differentiation under the inte- 
gral sign. 


Chapter 6 
Cauchy and Complex Functions to 1830 


6.1 Introduction 


Mathematicians in the 1810s did not feel that there was an identifiable single subject, 
albeit perhaps in an imperfect state, that could be called complex function theory. 
Rather, there were the topics involving multiple integrals, functions of several vari- 
ables, mappings from R? to R?, and similar topics, in all of which complex variables 
could be used, if only as a convenient notation for pairs of variables. 

Cauchy’s first involvement in these matters is his Mémoire of 1814 on the evalua- 
tion of double integrals. By the 1820s he had begun to consider that there might be a 
coherent topic involving complex functions of a complex variable (a topic he had not 
discussed in his Cours d’analyse or the Resumé). In the great Mémoire (1825) he pro- 
duced the integral and residue theorems that are today named after him, but even then 
he does not seem to have appreciated the depth of his discovery, perhaps because he 
was still not thinking geometrically about complex variables. A deeper appreciation 
had to wait until the 1840s, by which time other, younger, French mathematicians 
were taking up the subject, so the history of this part of the subject follows a much 
more disorganised course than does the impact of Cauchy’s work on real analysis. 

Yet, as we shall see (especially in Chaps. 11, 16, 17, 19, and 20) by the 1870s 
complex function theory was on its way to becoming one of the dominant fields of 
mathematics, and its origins are the other principal theme of this book. 


6.2 ‘The Passage from the Real to the Imaginary’ 


A curious feature of mathematical analysis in the years around 1800 was the use of 
complex variables to evaluate real definite integrals. The practice had begun with 
Euler and continued with Laplace, Poisson, and Legendre, in his Exercises de calcul 
intégral (1811). In the Mémoire on this topic that he presented in 1814 Cauchy 
commented that many of the integrals had been evaluated for the first time “by 
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means of a kind of induction” based on “the passage from the real to the imaginary” 
(Cauchy 1814, 329), and that no less a figure than Laplace had remarked that the 
method “however carefully employed, leaves something to be desired in the proofs 
of the results”.! Cauchy accordingly set himself the task of finding a “direct and 


rigorous analysis” of this dubious passage. 
Cauchy ‘Misses’ the Cauchy—Riemann Equations 


Cauchy’s Mémoire came in two parts. The first, entitled “The equations that authorise 
the passage from the real to the imaginary’, offered a defence of the method, based 
on apparent validity of the equation 


ALLO — ALO) FI 
Oz ox — 


(6.1) 


According to Cauchy, this equation could be verified “directly by a single differ- 
entiation” (Cauchy 1814, 337), and the whole first part of the memoir proceeds in 
a formal way that makes what later eyes would see as a series of tacit assump- 
tions about properties of the functions in question. He then substituted the equations 
y = M(x, z) + V—IN(a, z) and f(y) = P’ + /—1P" into Eq. (6.1) and obtained 


as 0U aT av 


= = 2 
Oz Ox’ Oz Ox ee) 
where 
OM ON OM ON 
— pra — pra ipl pir 
_ Ox Ox” v Oz Ox’ 
p= plo® 4 pvOM v = pon 4 prOM 
Ox Ox’ iti Oz 


According to Cauchy, nothing more needed to be done except to show how 
Eq. (6.2) can be used, and he filled the rest of this part of the memoir with examples. 
At no stage, even when he set M(x, z) = x and N(x, z) = z, did Cauchy observe 
that from (6.2) one could obtain the equations 


OP’ OP” oP’ ap" 
= (6.3) 


Since these are the Cauchy—Riemann equations, some historical comments are in 
order. The Cauchy—Riemann equations were first written down by d’ Alembert in his 
celebrated essay on hydrodynamics, Essai d’une nouvelle théorie de la résistance 
des fluides (1752, Chap.4, §45). They were created as part of an argument about 
the solutions of flows in the plane (vector fields, as we would say today), and even 
in Euler’s hands they remained part of a body of ideas that allowed the temporary 


'See Cauchy (1814, 329-330). 
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appearance of imaginary quantities on condition that they vanish at the end. They 
had no foundational significance, and Cauchy gave them none either. 

Cauchy’s main aim was the evaluation of improper integrals. When the functions 
S,T,U,V are regular within the rectangle of integration, Cauchy integrated the 
equations to obtain 


Jo" — S)dx = fe" = u'ydz, f cr" —T')\dx= fo —V')dz, 


where the primes indicate the values assumed by the functions at the limits of inte- 
gration. He then applied this to a number of particular cases. 

Memoirs presented to the Institut were reviewed by members of the Institut, and 
in reviewing Cauchy’s Legendre remarked that it “merely conforms to the ordinary 
rules of analysis, which is not a subject of any difficulty”. Siméon Denis Poisson was 
similarly unimpressed. In the long interval that intervened between the submission of 
the Mémoire—18 14—and its eventual resubmission—1825—Cauchy added numer- 
ous footnotes protesting at this judgement, some of which reflect his own deepening 
understanding of what was involved.” But in any case his chief interest as far as this 
paper was concerned was in double integrals. 


6.3 Cauchy, Double Integrals, and Integrals Round 
a Rectangle 


Mathematicians of the period, with their somewhat formal attitude to the processes 
of the calculus, not only tended to regard a double integral as a nested pair of single 
integrals but to be indifferent to the order of integration. This formal approach pre- 
cluded interpreting double integrals in terms of line integrals along the boundaries 
of a rectangle in the plane of complex numbers; they were strictly about integrals on 
a rectangle in the real plane. However, it was known that in some cases this attitude 
was too naive and led to conflicting results, and it was this problem that Cauchy 
addressed in the second part of the Mémoire. Legendre was more impressed by what 
he read here, and indeed Cauchy was much more original. 

Cauchy showed varying the order of integration of a double integral can yield two 
distinct definite values when the function becomes infinite or indeterminate for certain 
values of the variables within the domain of integration. To investigate the matter 
further, he introduced what he called singular integrals (“intégrales singuliéres’’), 
which he defined (Cauchy 1814, 334, see Sect.5.3) as limits as the end points of 
the integral tend to the same fixed value “without the integrals being zero.” Singular 
integrals are a profound idea. They were to crop up repeatedly in real analysis and, 
in an intriguing different way, in Cauchy’s later work on complex integrals. 


2The Mémoire was published in 1827. 
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Cauchy first supposed that the singular value of the integrand was in a corner of 


the rectangle. So, to evaluate the integral as [ C i 8K dxdz, where K = (x, Zz) 
becomes indeterminate for x = a, y = D, he split the integral into two parts: 


a b 


J [ Fose= | [Pace 


where A is the singular integral — i o(a + €,b + Cd, where ¢ is a ‘very small’ 
quantity and € equal to zero after the integration. 


Cauchy took this example 
z 


x2 + 22 


and the integral iC ig Dz K dzdx. He ate ale’ this first with respect to z and then 
with respect to x to obtain the value 7. But, on integrating first with respect to x and 
then with respect to z, he found it was necessary to add the singular integral A given 
by —arctan(e/¢), which for ¢ = 0 equals =". Therefore the value of the integral 
was = + A = —4. Cauchy then considered the general case, when the singular 
point (x, z) is inside the rectangle of integration, before applying his result to the 


evaluation of many improper integrals. 
Cauchy and Contemporary Definitions of the Complex Numbers 


Discussions about the meaning, nature, possibility and even existence of complex 
numbers form an oft-told tale in the history of mathematics. From 1813 Gergonne’s 
Annales carried numerous articles for and against Argand’s geometrical interpreta- 
tion of complex numbers, and Argand himself published definitive papers in 1806 
and 1814, where his ideas were explained in detail along with a new proof of the 
fundamental theorem of algebra.* Oddly, in this debate Cauchy’s name does not 
usually appear. What in fact did he know, or believe, about complex numbers? 

In his late fifties (see Cauchy 1849, 175) he recalled that when he was working as 
a young engineer at Cherbourg between 1810 and 1812 he was told that one Henri— 
Dominique Truel had found a way of representing complex numbers in a plane in 
1786. Some historians, for example Petrova (1974), claim that Cauchy knew nothing 
of the debates around Argand’s work, even though he was back in Paris by then, 
others that he found it insufficiently rigorous, and both interpretations are supported 
by the fact that Cauchy did not mention Argand’s work when he gave a proof of the 
fundamental theorem of algebra (Cauchy 1817a, b), nor did he do so in his Cours 
d’analyse, as we shall shortly discuss. 

It is worth observing that mathematicians in the 1810s regarded mathematical 
operations of all kinds rather formally, and considered square, cube and even nth 
roots as functions, even though they could be many-valued. Cauchy would have 
seen that what is manageable in the real case, say an expression such as V1 — k2x?, 


3See Gilain (1997). 
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becomes much less so in the complex case. It is usually easy to treat two real square 
roots, one positive one negative, separately, but two complex ones are harder to 
control once the variable x is allowed to be complex. With such functions in mind 
it is reasonable to believe that the difficulty is not in defining a complex number but 
in defining functions of a complex variable. Controversies had arisen, for example 
between Lagrange and Poisson, over the value of 


m 


m! 
(2cosx)”" = cos(m — 2k)x , (6.4) 
2, (m — kik! 


when m = 1/3 and x = 7. An application of de Moivre’s formula provides three 
different complex values: 


7130 + iV/3) 91/3 91/3 d— i/3) 
2 , ; 2 ; 


while the power series development of 2” cos” x according to (6.4) gives yet another 
different value: 5 (al ay which is the arithmetic mean of the first and third of the 
previous values. 


Complex functions in Cauchy’s Cours d’analyse 


Cauchy defined imaginary numbers in his Cours d’analyse (1821a, 173-176) much 
as we would define an ordered pair of real numbers, that is as “symbolical expres- 
sions” of the form a + /—1)3, where a and (3 are real quantities. But then, arguing 
that a symbolic expression signifies nothing, he gave over 55 pages to defining the 
arithmetical operations upon them and establishing their properties. This is one of 
the passages in the book where his concern for rigour is most apparent, particularly 
so in his treatment of the multi-valuedness nature of rational powers of imaginary 
numbers. Here his approach greatly surpassed anything else in the contemporary 
mathematical literature. It forms another step in his slow journey from formalism to 
a geometrical understanding of complex function theory, one he may not have fully 
appreciated as he took it. 

Existing controversies surely showed Cauchy the pitfalls in any naive and for- 
mal approach. It was not enough simply to replace real by complex numbers, and 
real by complex variables, in formulae, even when that seemed possible. A proper 
understanding was needed of how to pass in general from a function f(x) to a func- 
tion f(x + iy). Evidently there is no unique answer if the problem is posed at that 
general level: one simply writes f(x + iy) = f(x, y) = f(x) +ig(y) where the 
function g(y) is arbitrary. Yet, in some tantalising way, there is a single ‘correct’ 
answer for most known functions, such as the exponential and trigonometric func- 
tions. Prevailing conventions, he said, ‘are insufficient for fixing in a precise manner 
the meaning of the expressions A*, log x, sin x, cos x, arcsin x, arccos x when the 
variable x becomes imaginary’ (Cauchy 1821a, 240). 

Cauchy proceeded carefully. He defined the concepts of imaginary variable and 
imaginary limit, and introduced a special symbol (()) for denoting any value of 
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a multi-valued function (Cauchy 1821a, 7-12). For certain functions defined by a 
power series he concluded that it is enough to replace the real variable x by the 
complex variable x + iy, but not without a great deal of careful work. He first 
defined imaginary valued functions of a real variable, and extended the theorems 
already proved in the Cours d’analyse to them. He then proved that a geometric 
series with imaginary terms is convergent when |z| < 1, by considering pairs of real 
series, and remarked that its sum is a continuous function of z in the given interval, 
by appealing to his flawed theorem (see Sect. 4.3). Then came the extension of his 
convergence tests and the proof of the binomial theorem to the complex case, at 
which point the multi-valuedness question for functions had to be confronted. He 
answered it by resorting to their expansions in convergent series (1821a, 308-309). 
The definition of the corresponding inverse functions log x, arcsin x, arccos x was 
discussed in the final part of the chapter. 

But Cauchy did not extend his procedure ‘by analogy’ to any real function f (x) 
that was represented by a convergent power series by considering the imaginary 
function f(x + /—1y) as defined by the corresponding imaginary series (provided 
that it was convergent). It remained unclear in his work, and one must suppose to 
Cauchy himself, how to define and study the general imaginary function of an imag- 
inary variable—and it was remain so for some twenty more years. So, for example, 
in his (1823b, 330) Cauchy observed that one seemingly could define an imaginary 
expression by saying that f(x + t,/—1) represents a function ¢(x, f), continuous 
with respect to x and ¢ and satisfying both the condition ¢(x,0) = f(x) and the 
equation oe — @ /=1 . Evidently, such a function ¢ satisfies Laplace’s equation, 
but, said Cauchy, in view of what one knows about Laplace’s equation this merely 
leads to a vicious circle. 

Cauchy repeated the proof of the fundamental theorem of algebra that he had 
given in his (1817b) in Chap. X of his Cours d’analyse, and cited Legendre (1809) 
as a source of his proof. He denoted the square of the modulus of the polynomial 


f (x) = ou, v) + xu, v)V—1 


by F(u, v), and regarded it self-evidently as a continuous function of u and v. There- 
fore, he said, its minimum value is non-negative and is attained a certain number of 
times. Then he showed, by areductio ad absurdum argument, that this minimum must 
be zero, which proves the theorem. He then deduced that a polynomial of degree n 
could be s written as a product of n complex linear factors. 


Cauchy’s ‘Masterwork’ 


By the time he came to write his memoir (Cauchy 1825) that some have hailed as 
his masterwork, Cauchy was prepared to take greater risks, or, perhaps, had acquired 
greater confidence. In his still unpublished 1814 Mémoire he had explained what was 
meant by the notation if : J (x)dx, where xo and X are real bounds and f (x) “a real 
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or imaginary function of the variable x”. Now he was willing to let the variable x be 
complex, writing (1825, 43) that “it is necessary to represent by the notation 


X+Y¥/-1 
f(x)dx 
xotyo/—1 


the limit or one of the limits to which the sum of the products of the form 


>= (xi-1 + yi-1V 1) f @i-1 + vi-1V—D) 


converges’. This adapts his definition of a real integral to the complex setting. When 
x and y are monotonic continuous functions on the interval [fo, T], say x = $(t), y = 
w(t), the integral becomes 


T 
P=AL5s-1 = / (oo 4 V=Iv'o) f (0) 4 V=TH) dt (6.5) 


1 


This is an integral with real bounds, and Cauchy proved, by considering what happens 
when the functions x = ¢(t) and y = (ft) are varied slightly, that the integral (6.5) 
is independent of the nature of these functions, provided f(x + y/—1) remains 
“finite and continuous” within the limits of integration. 

We recognise this result as the ancestor, and perhaps a close ancestor, of the 
Cauchy integral theorem. 

Here we encounter several features of what can be called Cauchy’s mature writing 
style. The concept of a continuous function of an imaginary variable is not to be found 
explicitly defined in this, or any of his previous papers. Moreover, despite the way 
the theorem was stated, Cauchy tacitly assumed that the function f has a continuous 
derivative. This unfortunate habit would have been easy to correct when Cauchy came 
to write this (and other) papers; the fact that he did not suggests that the confusion 
went deeper than poor writing. Finally, the phrase of the “finite and continuous” is not 
adequate to the problem at hand. Here is seems to mean single-valued and infinite, 
but it may be better to interpret as ‘well-behaved’ in a way that Cauchy could feel 
but not yet capture precisely. 

Some historians have suggested that Cauchy’s lack of sensitivity to the meaning 
of the terms involved was simply continuing to follow the old tradition of regarding 
continuous functions as differentiable, because they were de facto given by ana- 
lytical expressions.* And while it may be unwise to attribute to Cauchy, who was 
always intellectually on the move, a steady conviction about anything mathematical, 
this view seems implausible given that Cauchy had been at pains in the Resumé 
to distinguish a function from its Taylor expansion, and to define continuity and 


4See Kline (1972, 637) and Remmert (1991, 196). 
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differentiability in distinct ways. A more plausible alternative is that Cauchy took it 
for granted that a continuous function of a complex variable satisfied the Cauchy— 
Riemann equations. This implies that he was willing to suppose that functions were 
differentiable in the course of a proof without saying so explicitly in the statement 
of a theorem, but such, confusingly, does seem to be the case on several occasions. 

Be that as it may, the deeper question is whether, in 1825, Cauchy was in pos- 
session of a theorem that says a complex integral is path independent in any domain 
where the function does not become infinite. A fair summary of what Cauchy wrote 
(1825, 20-24) would be that if one considers two different paths within the rectangle 
(xo, yo), (X, Y) and the function f(x + y/—1) does not become infinite for values 
x, y lying within the domain enclosed by the paths, then the corresponding values 
of the integral 7 are equal, i.e. the integral J is independent of the path of integra- 
tion. But this could simply mean that the integral is independent of paths within the 
rectangle in the familiar real plane. Even the formalism of path integrals involving 
complex variables can be read as an explanation of how to escape from the troubling 
world of complex variable and complex functions to the safer world of real paths and 
mappings from R? to R*. His theorem need not be about complex integrals at all. 

Cauchy next investigated the case when the function f(x + y./—1) becomes 
infinite at the point where x = a, y = b, corresponding to the value tf = 7. He 
introduced the limit of 


f =lim(x — a+ (y —b)V—D) f (x + yV—1) 


as x tended to a and y tended to b, which he denoted f* = ef (x + y/—1), € being 
an infinitely small number. In his Résumé he had given a theory of the principal value 
of an integral, which he now used to show that the principal value of the integral [ 
was 27,./—1 f*. Using results from his Mémoire of 1814, he deduced that 
* 
i@= a + w(z), 
z—a-—ib 
where w(z) remains finite in a neighbourhood of the origin and so contributes nothing 
to the value of the integral. So, treating the imaginary function f (x+y./—1) as he had 
treated function of two real variables having a point of infinity within the rectangle 
of integration, he obtained the expression {* = e f (x + y/—1) and the value 277i f* 
for the integral. The extension to a finite number of simple infinities was immediate, 
and for higher order infinities, say one of order m, Cauchy used a limiting argument 


to deduce that 
_ f™ YV@tov-1 
49. tad) 


aa 


Here we see, at the same ancestral level, the Cauchy residue theorem. 
Cauchy’s first use for his theorems, in his (1825), was to real (improper) integrals. 
But within a year he began publishing his Exercises de mathématiques, which can be 
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thought of as a personal journal whose issues appeared monthly until 1830. From the 
very first issue onwards, Cauchy developed his new calculus in a more systematic 
way. He called it the “calculus of residues”, he introduced a special symbol for 
indicating “the extraction of the residue” of a function, and he found numerous 
applications for it. What he did not do was to appreciate the fundamental character 
of his integral theorem, which was to become one of the basic theorems of complex 
function theory. He neither used nor quoted from his 1825 paper and his integral 
theorem for two decades—and Cauchy was not at all averse to quoting from his own 
papers. 

Hans Freudenthal found this “utterly strange and hard to explain” (1971, 139), 
and asked: 


Did Cauchy not trust the variational method of proof? Was he bothered by the (unnecessary) 
condition he had imposed on the paths, staying within a rectangle? Did he not notice that 
the statement could be transformed into the one about closed paths that he most needed? Or 
had he simply forgotten about that mémoire détaché? In any case, for more than twenty-five 
years he restricted himself to rectangular paths or circular-annular ones (deriving from the 
rectangular kind by mapping), thus relying on the outdated 1814 mémoire rather than on 
that of 1825. 


One can go further: nor did Cauchy mention the integral theorem in a paper he 
presented to the Academy on August 3, 1846 (Cauchy 1846a) which is commonly 
credited by historians as containing his first general formulation of the integral the- 
orem. 

In this case it seems that historians may have been to eager to read a modern 
interpretation into an old result, for there is no reference in Cauchy’s paper (1846a) 
to either complex functions or complex integration. It concerns the question of when 
the integrals of some differentials are independent of the path of integration, and 
Green’s work on potential theory is clearly in the background. It is likely that 
Cauchy had picked up the news of Green’s work (1828) from the young William 
Thomson (later Lord Kelvin) who visited Paris in spring 1845 and showed a copy to 
Joseph Liouville, and, possibly, to Cauchy, whom he visited several times.° 

Of course, as for instance Remmert (1991, 198-199) has observed, one can imme- 
diately derive the Cauchy integral theorem from his paper (1846a)—but Cauchy did 
not. He wrote only of real variables and real functions, giving no reason to let the 
historian infer that he was writing about complex functions at all. 

On the other hand, Kline (1972, 638) is surely right when he wrote that “Cauchy 
must have thought long and hard to realize that some relations between pairs of 
real functions achieve their simplest form when complex quantities are introduced”. 
In fact, in a steady stream of papers in the autumn of 1846 Cauchy turned to the 
complex interpretation of his theorems. For example, in Cauchy (1846b, 136) he 
extended his theorem of residues to a plane domain bounded by a closed curve. Ina 


5As Kline suggested (1972, 640), and Freudenthal agreed (1971, 140). 


For Thompson’s stay in Paris, and his contacts with Liouville and Cauchy, see Liitzen (1990, 
135-146). 
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paper published a week later he explained the result in a more geometrical way that 
allowed him to speak of a curvilinear integral along a line OP, and he observed that 


when one arrives at the point P the value of this latter integral is generally independent of 
the straight line or curve that one has followed (1846c, 146-147). 


Cauchy also observed that his results assumed that the functions he was dealing 
with were all single-valued, and he made some vague remarks about how many- 
valued functions could be dealt with, but this was to remain a weakness in his work. 


Politics Intervenes 


Not only was Cauchy himself unsure for many years where the deepest significance 
of his results lay, a confused publication regime further complicated the reception of 
his ideas. In July 1830 riots in Paris led to the overthrow of the Bourbon monarchy, 
to which Cauchy felt he had taken an oath of allegiance. 

He left for Turin, where the Bourbon still retained influence and had the support 
of the King of Sardinia. The Jesuits were powerful at court, Cauchy’s interpretation 
of Catholicism was close to theirs, and in January 1832 the King re-opened the chair 
of physics at the University of Turin and appointed Cauchy to it. Cauchy was also 
welcome at the Academy of Sciences in Turin, and presented some important papers 
there that will be discussed below in Chap. 10. Then in July 1833 he left to serve as 
a tutor of the son of the exiled king Charles X in Prague before returning in Paris in 
1838. He lost his teaching position at the Ecole Polytechnique when he left, and he 
was unable to obtain a position for several years after his return because of his refusal 
to swear an oath to the new regime. His absence from Paris undoubtedly diminished 
the impact of his work. 

Bruno Belhoste, whose biography of Cauchy (Belhoste 1991) is well worth read- 
ing, notes that the early 1830s were momentous for French mathematics in other 
ways: Fourier died that year, Galois in 1832, Legendre in 1833. As Belhoste writes: 
“all foretold the end of an era. Cauchy’s absence from Paris was now painfully felt.” 


6.4 Exercises 


1. Grabiner has argued that many of the techniques of e — 6 — N analysis can be 
found in the work of Lagrange and others of Cauchy’s predecessors. In the light 
of this information, what exactly was the originality of Cauchy’s contributions to 
the foundations of analysis in the Cours and the Resumé? 

2. What do you understand by Liitzen’s idea of an architecture of mathematical 
analysis? 

3. Write areview of Cauchy’s mémoire (1825) on integration in the complex domain, 
indicating what he has achieved, what is satisfactory and what is unsatisfactory in 
it, and what could be done next, from the standpoint of one of his contemporaries. 


7See Belhoste (1991, 157). Belhoste documents a few occasions when Cauchy returned to Paris, 
and it does not seem that Cauchy’s departure for Italy was intended to be an absolute break. 


Chapter 7 
Abel 


Niels Henrik Abel (1802-1829) 


7.1 Introduction 


One of the great excitements in the mathematical world of the 1830s and 1840s 
was the discovery by Abel and Jacobi independently of elliptic functions. These 
are complex-valued functions of a complex variable, and as such they were new 
and proved an important stimulus for the growth of a theory of complex functions. 
Equally importantly, they had properties akin to the trigonometric functions—they 
were, however, not singly but doubly periodic, as will be explained below—and this 
made them attractive to study. And, as Legendre had already shown, they were likely 
to have many applications in mechanics. 

In this chapter we look at how their discovery burst onto the French and German 
communities, and then look at the short life of Abel and indicate some of the key 
properties of the functions he discovered. 
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7.2 The Prize of 1830 


Niels Henrik Abel of Christiania in Norway and Carl Gustav Jacob Jacobi, who was 
a Professor at the University of Konigsberg, came independently to the subject of 
Legendre’s elliptic integrals in 1827. They very quickly entered into a rivalry, the 
drama of which was heightened by the rapid stream of their publications. Legendre 
was evidently delighted with the attention his favourite subject was receiving. He 
could see that with Abel and Jacobi taking it up, its fame was assured. In the first 
supplement to his Traité, dated 12 August 1828, Legendre wrote that 


Until then geometers had taken almost no part in this kind of research, but scarcely had my 
work seen the light of day, scarcely could its title have become known to scientists abroad, 
when I learned with as much astonishment as satisfaction that two young geometers, MM. 
Jacobi of Konigsberg and Abel of Christiania had succeeded in their own individual work 
in considerably improving the theory of elliptic functions at its highest points. 


So in 1829, the year he turned 77, Legendre, by then quite ill, worked hard 
to ensure that the Grand Prize of the Academy of Sciences in Paris went to the 
two men who had recently transformed his theory. He made sure that the Paris 
Academy asked him and Poisson to report on it, and Poisson presented their report 
on Jacobi’s work on 21 December 1829.! The report, which was based largely on 
Jacobi’s book, the Fundamenta nova, which had been published in April 1829, and 
it and the excitement surrounding the work of Jacobi and Abel, led to the German 
being elected a corresponding member of the Academy on the 8th of February 1830. 
Then on 28 June 1830 the Academy announced that its Grand Prize in mathematics, 
worth 3,000 francs, would be divided equally between Jacobi and the family of the 
late Niels Henrik Abel of Christiania, for Abel had died on April 6 1829 at the age 
of 26.* 

The theory of elliptic functions, as the new complex functions created by Abel and 
Jacobi are called, grew to be one of the central domains of 19th century mathematics. 
Both men had transformed Legendre’s subject of elliptic integrals in the same ways, 
which added to the sense of competition: they had inverted the integrals by treating 
their upper end points as the independent variable, as will be described below, and let 
the endpoints be complex. The results of these changes were elegant and profound, 
and were to be a major on the later development of complex function theory. Indeed, 
Jacobi was to remark later that the introduction of imaginaries alone was enough to 
solve all the riddles of the early theory (see Dirichlet (1852, 10)). 

Part of the charm of elliptic functions was that they were a rich generalisation of 
the familiar trigonometric functions; they were interconnected by a rich collection 
of identities. And it rapidly transpired that they had many uses: in geometry, in num- 
ber theory, in mechanics, and naturally in astronomy because planets and asteroids 
traverse ellipses. There was therefore was a working theory of approximations to 


' Procés-verbaux des séances de l’Académie 9 (1921), 373; Poisson Rapport (1831). 
2 Among those passed over for the prize on this occasion was Evariste Galois. 
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elliptic integrals in theoretical astronomy, which indeed was the occasion of Gauss’s 
only publication on the subject (1818). 

The idea of inversion was suggested so strongly by the familiar example of the 
trigonometric functions that one can almost wonder how Legendre missed it—the 
answer being, of course, that the integral is sufficiently fascinating. Moreover, it 
arises naturally in the mechanical problems Legendre studied. And no-one would 
ignore the logarithm multi-function just because its inverse, the exponential function, 
has simpler mathematical properties. That said, as is well-known, the integral 


» 4 ds 
u(x) = ——_—— 
0 1—s2 


reduces, on substituting s = sinz and t = arcsin(x), to 


t 
u(x) = ¥ dr =t =arcsin(x). 
0 


In this context, it is natural to invert the answer, and write x = sin(w). This gives 
a well-understood single-valued function instead of a multi-function. We will see 
below why inversion of an elliptic integral suggests that the variables should also 
become complex. That step was taken by Gauss in work he did not publish, Jacobi, 
and Abel. Legendre knew very well that his tables for his real-valued elliptic integrals 
allowed him to infer the values of their inverse functions, but never studied the inverse 
functions directly. In Jacobi’s opinion that 


The fear that the mathematician most chiefly concerned with the determination of numerical 
values had of the imaginary, was the reason that Legendre was prevented from taking the 
most important step in modern analysis, the introduction of doubly periodic functions. 


Abel had also picked up an interesting clue. In 1801 Gauss had published his 
Disquisitiones Arithmeticae, the book that transformed number theory and helped 
make his name. In its seventh chapter he wrote about the algebra of the problem of 
dividing a circle into n equal parts—the topic is called cyclotomy—and in §335 he 
observed of the principles of this theory that “Not only can they be applied to the 
theory of circular functions, but also to many other transcendental functions, e.g. 
those which depend on the integral [ vie .” Gauss never kept his promise to return 
to this topic and give a full explanation, but Abel took the remark as a reason to look 
at that integral in the same way as the integral defining the circular or trigonometric 
functions. 


3 Quoted in Koenigsberger (1904b, 54) and Krazer (1909, 55n). 
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7.3 Abel and his Travels 


Abel was lucky enough to have an inspiring mathematical teacher in his last year at 
school, Bernt Michael Holmboe.* Holmboe lent him his own copies of the classic 
works of Euler on the differential and integral calculus, and when Abel was only 
16 began to give him private lessons in the works of Lagrange. His school report 
on Abel said that the youth was “an excellent mathematical genius”. When he went 
to university, Abel’s talents were recognised by the small group of scholars intent 
on raising the level of science in Norway, among the Christopher Hansteen, whose 
interests were in terrestrial magnetism and who encouraged him to think of a career 
in mathematics. Among Abel’s first ventures into original research was an erroneous 
‘proof’ that a polynomial equation of degree 5 is solvable by radicals. Hansteen knew 
that this was an outstanding problem, and that neither he nor anyone in Christiania 
was competent to pronounce upon Abel’s argument, so he sent him to Copenhagen 
and the mathematician there, Ferdinand Degen. Nor could Degen find the flaw, but 
his sensible advice to test the method on examples soon revealed that there was one.» 

Abel wrote to Holmboe about the trip, and here he also wrote for the first time 
about his work on elliptic integrals. Here too had found something unexpected, and 
this time Degen could not help him. Abel wrote®: 


You remember the little paper which treated the inverse functions of the elliptic transcen- 
dentals, where I proved something impossible; I asked him to read it from one end to the 
other; but he could not discover any false conclusion nor understand where the fault might 
be; God knows how I will get out of it. 


The most likely interpretation of the ‘mistake’ is that it was no mistake at all, but 
a novel property.’ If we briefly call the new function f(x) then, by analogy with the 
trigonometric functions, there ought to be an equation connecting f(x) and f(x/n), 
and there is. However, as an equation for f(x/n) for a given f (x) it is of degree n7, 
and not n, as the analogy would lead one to suspect.® Whatever the explanation, the 
letter is explicit that Abel had already taken one crucial step that Legendre had not, 
and inverted the integrals. 

It was obvious to all who knew him that Abel’s tremendous talent for mathematics 
could not be adequately supported in his native Norway, and with some effort funds 
were found for him to study abroad. He left with three friends on September 7th 
1825 to go to Berlin, partly because Abel’s poor German was better than his French. 
There Abel met the mathematical entrepreneur and Privy Councillor Leopold Crelle. 


4On Abel’s short life see Stubhaug (2000). The earlier Ore (1957) is more reliable on the mathe- 
matics, which is described in greater detail and depth in Houzel (2004). 


5In due course Abel’s proof that the quintic equation is not solvable by radicals was one of the great 
breakthroughs opening the door to Galois’s work. 


6 Quoted in Ore (1957, 65). 
7See e.g. Krazer (1909, 56). 


8The actual number is n” if n is odd and 2n? if n is even. A sign enters the expression for the general 
solution that must be positive when n is odd but may be positive or negative when n is even. 
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As a young man, Crelle had hoped to be a great German mathematician and build up 
German mathematics, but he realised that he lacked the ability and turned to other 
careers. He became an engineer, and constructed the railway from Berlin to nearby 
Potsdam, but he retained his love for mathematics and his belief that the subject 
should not be shackled to applications, which in his view had stunted its growth 
in Germany hitherto. In 1825 he was busy creating a new journal, which would be 
one of the very few anywhere to be exclusively devoted to mathematics, and his 
meeting with Abel could not have been more fortunate for each man. Abel gained 
access to Crelle’s extensive library, and he found in Crelle a supporter who would 
translate his many papers—no less than seven were to appear in the first volume 
of Crelle’s new journal alone. Crelle gained the allegiance of a mathematician who 
would immediately place his new journal, the Journal fiir die reine und angewandte 
Mathematik (or, Journal for pure and applied Mathematics), at the forefront of the 
subject.” 

Abel wrote on a variety of subjects, but often with an eye to the algebraic impli- 
cations. Even before leaving Norway he had recovered from his early mistake, and 
had a paper ready showing that the general polynomial equation of degree five is not 
solvable by radicals.!° In Germany he routinely scrutinised the polynomial equations 
of higher degree that occurred in his work to see if they might be solvable by radicals. 
As noted in Sect. 4.3 above he became aware of Cauchy’s work on real analysis, and 
wrote his own paper (Abel 1826) on the convergence of the binomial series, where 
commented on the lack of continuity of certain Fourier series. 

He did not go to visit Gauss in Géttingen, perhaps because Crelle advised against 
it. Crelle’s opinion of Gauss’s genius was, of course, high, but he tempered it with 
a realistic appraisal of the man’s style, saying that everything Gauss man wrote 
was “an abomination—gruel—so obscure that it is almost impossible to understand 
anything” and Hansteen added that Gauss “is like the fox who covers his tracks in 
the snow with this tail”.!! Instead, Abel went to Paris, where he arrived on July 10, 
1826. This should have been the highlight of his travels, his visit to the centre of 
the mathematical world, but it was not. He found the French mathematicians rather 
unapproachable, and almost comically his best contact turned out to be the young 
Lejeune Dirichlet, who called on Abel under the mistaken impression that he was a 
fellow German. When Abel met Cauchy he found him mad and impossible to deal 
with; Cauchy generally had a habit of talking only about his own work. In a letter to 
Holmboe (24 Oct) he remarked that “He is extremely Catholic and bigoted, which 
is strange for a mathematician.”.!7 Perhaps the biggest set back for Abel was his 


° As commentators noted, despite its title it concentrated on pure mathematics, to the point that 
wits called it the Journal fiir die reine unangewandte Mathematik, the Journal for purely unapplied 
mathematics. It became, and remains, one of the leading journals for mathematics in the world. 


'0He seems not to have known at this stage of the comparable work of Ruffini from 1799. 
11 Both remarks are in Bjerknes (1885, 92). 
See Abel (1902, 4546). 
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realisation that Cauchy was the only person actively engaged in pure mathematics, 
everyone else having been swept up in an enthusiasm for magnetism and other topics 
in physics. He did meet Legendre, who was then 74, but his dismissive comment in 
a letter was that he found him “as old as stone”. 


7.4 Abel on Elliptic Functions 


In December 1826 Abel took up the hint he had found in Gauss’s Disquisitiones 
arithmeticae. As we saw above, the arc length of the lemniscate is given by the 
simplest of elliptic integrals: 


; dt 
0 


He was able to show, as he wrote to Crelle, that indeed the lemniscatic arc can be 
divided by ruler and compass into n equal arcs for precisely the same values of n as 
the circular arc, exactly as Gauss had hinted, and to Holmboe he said that he now 
saw as clear as day how Gauss had come to discover his results.!* But Abel was not 
happy in Paris, and in January 1827 he was back in Berlin. 

1827 was the year his great work on elliptic functions began, and his first major 
paper on elliptic functions, the ‘Recherches sur les Fonctions elliptiques’, appeared 
in Crelle’s Journal. It is the first published account of the theory of elliptic functions. 
They are presented, albeit formally, as complex functions of a complex variable, but 
also in close analogy with the trigonometric functions. He introduced three functions 
(the analogues of the sine and cosine functions), and derived addition and division 
laws for them. He ended the paper with expansions of the new functions in infinite 
series and as infinite products. 

Abel wrote the general elliptic integral of the first kind not quite as Legendre had 
done but in the slightly different form: 


a” a 
= 7 
° eecarers — 


Consequently, the inverse function x = ¢(q) therefore satisfies 


W220 hee. 
da 


'3See Abel (1881, 2, 261). The circumference of the lemniscate is divisible into m equal parts by 
ruler and compass alone (italics Abel’s) if m is of the form 2” or a prime of the form 2” + 1, or if 
m is a product of numbers of these two kinds. See Houzel (2004). 


74 Abel on Elliptic Functions 75 


Abel then defined the functions 
f(@~:=V1-c¢(a), and F(a) :=V1+e7¢(a). (7.3) 


He noted that each of the equations ¢(a) = 0, f(a) = 0, F(a) = 0 have infinitely 
many roots which can be determined, and for any integer m, @(ma) can easily be 
found in terms of @(a), etc.!4 

The division problem is, however, harder, and he showed that the equation for 
o(a), f(a), or F(a) in terms of d(nq) is of degree n?. The analogy with the trigono- 
metric functions is evident, but the degree of the equation may be unexpected, and 
the memoir is focussed on understanding this equation. 


Abel defined 
l/c 
dt 


WwW 
2~ / Jd — c212)(1 + e212) 


(7.4) 


The function ¢ is positive in the range 0 < a < 4, (0) = O and OS) = 7 

Moreover, ¢ is an odd function, ¢(—a) = —¢(a). To explain the degree being n’, 

Abel introduced complex variables. He replaced a formally by iG, so ix = $(if), 
x 


and noted that @ = { ———“____ 
0 a/ (1+ce21t7)(1—e217) 


x = 1/e. He inverted this integral and defined 


is real and positive between x = O and 


I/ie 


w | dt 
20 J Vd +22) — €212)’ 


0 


(7.5) 


where x is positive in the range 0 < @ < - Therefore ¢(ia) = id(a), F(ia) = 
f(a), and o(4) = §. 

The functions ¢, f, F are now known along the real and imaginary axes. To find 
their general values Abel established these addition formulae: 


b(a) f(D) F (B) + O(B) f(a) F(@) 


d(a+ f) = . 
=e 
Fe ea HMA FOO, 
2 
F(a+ 2) = F(a)F(6) +e POP) IO FB) 


R 


where R = 1 + c*e?d?(a)¢?({). 


'4Compare the equation for sin(nx) given sin(x). 
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He argued that if r denote the right-hand side of the first equation then the expres- 
sion for gr is symmetric in @ and (3, and so dr = ar from which it follows that r is 
a function of a + 3 that can be found by setting G = 0. This implies that r(a + /3) 
= d(a + f). 

Abel now deduced a slew of other formulae then follow from these addition for- 
mulae, formulae for ¢(a + 3) + ¢(a — 3), for f(a + 8) + f(a — 8) for d(a+ $F), 
and so forth. From 

$a t+ $) = o(—a + $) and g(a + F) = o(-a + ¥) 
he deduced that 


Ww ia. w io _ (2 iw, | 16 
ere ee ers le Cae eal (7.6) 


which in due course became the recognition that these functions have poles at 3 + ip 
The formulae at once imply the functions are periodic: 


o(a+w) = —o(a), (a+ 2w) = g(a), (7.7) 
ba+iv) = —d(a), d(a + 21%) = d(a), (7.8) 


(the formulae for f and F are similar), and Abel observed (§5) 


The formulae that have been established make clear that one will have all the values of the 
functions ¢(@), f(a), F(q) for all real or imaginary values of the variable if one knows them 
for the real values of this quantity lying between $ and S* and for the imaginary values of 


the form i3, where (3 lies between 5 and —$. 


The quantities 2w and 2iw are periods for the new functions, which are therefore 
said to be doubly periodic. The zeros of @(a +i), for example, can be found by 
looking in the rectangle =" < a < $, =*% < ( < § and appealing to the double 
periodicity. By the addition formula, the equation d(a +i) = O reduces to two 
equations on separating out real and imaginary parts: 


Pa) fUAFUS)=0, oa) f(B)F(A) = 0. 


These equations in turn yield a = mw, 3 = nw. The similar equations for f and 
F imply f ()F G8) = 0 a = (m+ 4)w, b = (n+ 5), and yield the values for 
which ¢ is infinite. So the zeros of f(x) are x = mw + niwi. Abel likewise solved 
the equation ¢(x) = 5 and found x = (m + 5w) + (n+ 5id), and the equation 
(x) = (a) in terms of a and found x = (—1)"*"a + mw 4+ nid. 

Thus far, the analogy with the trigonometric functions had guided Abel’s work 
rather smoothly. He was also able quite easily to express ((mq) as a rational function 
of d(a), f (a), and F(a). Heset (na) = £» and found recurrence relations for P,, and 
Q», and dealt similarly with f (na), and F (na). However, the converse question— 
express, say, 6(a/m) as a function of (a), f(a), and F(a@)—proved to be harder; 


'5 Abel also observed that these results followed from those in Legendre’s Exercises. 
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what Abel did here was important in the later work on the theory of equations, and 
influenced the future development of Galois theory (see Houzel 2004). 
As is well known, the equation for sin(u/3) given sin u is the cubic equation 


4x3 — 3x + sinu = 0, 


because sin u = 3 sin(u/3) — 4 sin? (u/3), and clearly the angles (u/3), (u + 27)/3 
and (u + 47) /3 are such that their sines satisfy this equation. However, in the elliptic 
case, the division equation that gives @(u/3) in terms of @(u) is of degree 9. This 
is because some of the roots in the elliptic case are complex, whereas as in the 
trigonometric case they are real. Once the elliptic function is treated as a function of 
a complex variable, which displays its double periodicity, the values of u that satisfy 
the equation for ¢(u/3) will all clearly be of the form ug + (mw + mw)/3, and so 
there are 9 of them. This makes it clear why the number of solutions is unexpectedly 
large. 

Abel’s ideas about complex variables were entirely formal, but that said he pre- 
sented a fully developed theory of elliptic functions as complex functions, analogous 
to the trigonometric functions. We shall now see that Jacobi’s way in to the subject 
was significantly different, and closer to Legendre’s. 


Chapter 8 
Jacobi 


Carl Gustav Jacob Jacobi (1804-1851) 


8.1 Introduction 


In this chapter we look at the life of Jacobi, and indicate some of the key properties 
of elliptic functions that he discovered by taking a different route to Abel. Then we 
look at how he published his work, the correspondence with Legendre, and his major 
book the Fundamenta nova of 1829. As we shall see, Jacobi also found important 
and entirely new applications of elliptic function theory to number theory and to 
geometry. 

Carl Gustav Jacob Jacobi was born into a banker’s family in Potsdam in 1804. 
He went to the University of Berlin at the age of 16, by which time he had already 
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been reading the works of Euler, Lagrange, and Laplace, and in 1825, at the age 
of 20, he submitted his Ph.D. and received permission to begin his Habilitation, the 
then-necessary and sufficient qualification for teaching in any German university (it 
was not a sufficient condition for being paid). In May 1826 Jacobi transferred to 
the University at KOnigsberg, where he became an associate professor at the end of 
1827. The University had a small but significant science faculty: Franz Neumann and 
Heinrich Dove taught physics, Gauss’s friend the astronomer Bessel was also there, 
and these high-powered colleagues sparked a lifelong interest in applied mathemat- 
ics. Ironically, the topic that made his name internationally and which interests us 
most, elliptic functions, was the only one where success did not come easily to him. 
Dirichlet, his close friend in later life, told this story in his obituary of Jacobi: 
One of his friends who noticed him in a bad mood one day, received this answer when he 
asked why he was out of sorts: You see me on the point of returning this book (Legendre’s 
Exercises) to the library, with which I’ve been exceedingly unlucky. On other occasions 
when I have studied an important work this has stimulated me to some thoughts of my own 
and there has always been something in it for me. This time I have come away quite empty 
handed and have not been inspired in the least (Jacobi, Werke, I, 7). 


Later, but richer, was Dirichlet’s comment. 


8.2 Jacobi and the Transformations of Elliptic Integrals 


Jacobi took a different route from Abel’s into the subject: the transformation of 
one elliptic integral into another by rational changes of the variable.' In early 1827 
he had not seen Legendre’s Traité and he did not know that Legendre had found 
a transformation of order 3, but by June of that year he had found become the 
first to discover the existence of transformations of every degree. His two notes on 
his discoveries appeared in the September issue of the Astronomische Nachrichten, 
edited by another of Gauss’s friends, Heinrich Christian Schumacher. 
In the first note Jacobi considered elliptic integral 


0) wv 
dt 
/——= 2 cine =m | —_— 1 k2 2 
i v (1 — c* sin“ ft) ‘ Vv (1 — k* sin‘ fr) 


as a function of ¢, and sought to transform it into another involving a new variable 
w. The transformation will produce values of the new modulus, k, and the number 
m. Jacobi claimed there are nth order transformations for every integer n that can 
be found by writing sin ¢ as a rational function of sin 7: say, sind = v where U 
is a polynomial in odd powers of sin 7 up to the nth, and V is a polynomial in even 
powers of sin 7 up to the (n — 1)th. However, at this stage he could only prove his 
claim when n = 3 or 5; the rest of his claim was strictly speaking only a conjecture. 


'See Cogliati (2014). 
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He also pointed out that the composition of two transformations yields solutions 
to the multiplication problem of order n relating two elliptic integrals 


} 


0) Y 
dt dt 
Ji=26n5 | -/i—=caey. 
: (1 — c* sin“ t) (1 — c* sin“ ft) 


In his second note Jacobi he showed how his new transformations, like Legendre’s, 
could assist in calculating tables of values for the integrals. 

There has always been a question about how Jacobi came to the idea of inverting 
the elliptic integrals. He took this step in the December edition of Schumacher’s 
journal, when he proved his general transformation formula.” He argued that the 


effect of the substitution y = & is to transform 


dy _ dy 
Jd —ayd—-ayd—ayyd—ay) JS fa) 


into 
dy _ dy 
M/C —byd—biyd—hy1—by) = JS(ga) 
where, if 
U 
T2 = v4 fay) 
g4(x) 


then i =V ae U A and M is to be a constant. 
To carry out the transformation of 


dy 
V(d = y2) — A2y?)) 


into 
dx 


MJ/(A — x2) — k2x2)) | 


0) 
he denoted the integral [ 


dt 
0 a/ (1—k?2 sint) 


n/2 


by F(@) and defined 


dt 


Jd —k2 sin? 1) 


2See Jacobi (1827). 
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He now remarked that if F(¢) + F(w) = F(c) then o is the expression given by 
Legendre (see above, (3.6)) and writing F(¢) = &, Jacobi said 


x He 
F — 
7 | V(d = #)0 = #1) 


and, inverting the elliptic integral, 
x = sin(am(&)). 


He now reinterpreted formulae such as the one for o in terms of ¢ and w so that 
they apply to x = sin(am(€)), and found formulae first for V in terms of x, then for 
U in terms of x, and finally for M. The coefficients of U and V and the expression 
for M are full of terms like sinam hy and sinam aa , SO it is quite evident that 
Jacobi could never have proved his general claim with inverting the elliptic integral. 


When, then, did he discover it? 


The Legendre—Jacobi Correspondence 


For this we can turn to the famous correspondence between Legendre and Jacobi, 
which began when Jacobi wrote to Legendre on 5th August 1827 to inform him of the 
discoveries that Schumacher was shortly to publish.* He described his work on the 
transformation of elliptic integrals, mentioned in passing that Gauss had discovered 
a transformation of order 7, and ended with some remarks about number theory that 
were to prove significant later. 

Legendre replied on 30 November 1827. He observed that he had already found 
the transformation of order 3, and welcomed the one of order 5, but confessed that he 
could not find polynomials U and V for the 7th order transformation and doubted that 
they existed. He therefore urged Jacobi to find and publish a proof. But unwittingly 
Jacobi’s favourable reference to Gauss upset the older man. Legendre launched a 
string of complaints: Gauss had claimed the law of quadratic reciprocity as his when 
he, Legendre, had published it in 1785 (in fact, Legendre’s statement was correct 
but Gauss’s proof was the first correct proof). Gauss had tried to claim priority over 
Legendre for the method of least squares (again, this was correct but Gauss had 
not published his method, only put it to use). Now, apparently, Gauss was claiming 
to have discovered one of Jacobi’s ideas in 1808. “Such excessive impudence’”’, 
Legendre wrote, “was unbelievable in a man of Gauss’s abilities”, and Jacobi should 
not worry about it. 

Legendre then took the step of communicating Jacobi’s results to the Académie 
des Sciences in the warmest terms on 5 November 1827. His report was published 
in the Globe on the 29th of November, which delighted Jacobi. He next wrote to 


3See Legendre and Jacobi (1875). 
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Legendre on 12 January to inform him of the content of Abel’s ‘Recherches’, the first 
part of which, with the inversion of the elliptic integrals, was now in print, and drew 
attention to the striking division of the lemniscate. He then informed Legendre that 
that the next edition of Schumacher’s journal would carry his proof of the existence 
of the general transformation. Legendre replied on 9 February 1828 to say that he 
had already seen Abel’s paper, although he was happy to have had it analysed in 
Jacobi’s language, which was closer to his own, and he went on: 


It is a great satisfaction to me to see two young mathematicians, like you and him, cultivate 
successfully a branch of analysis which has for so long been my favourite object of study 
and which has not attracted the attention in my country that it deserves. 


In his reply to this letter (12 April 1828) Jacobi described his route to his original 
discovery, making it clear that he had had the idea of inverting the integrals before 
he wrote his second letter to Schumacher, in August 1827. 

Does this establish that Jacobi came to the idea of inverting the elliptic integral a 
month before he could have seen Abel’s paper? Krazer was willing to accept Jacobi’s 
testimony, but Carl Anton Bjerknes was the first to raise the idea that Abel’s work 
might have provided the necessary inspiration for Jacobi, which he later covered 
up. This in turn might be a Norwegian reaction to the way, at the time, that history 
seemed to have favoured Jacobi’s approach over Abel’s, a matter naturally influenced 
by the fact that Abel had died so young. But it was not the view of Sophus Lie, who 
was working on the second edition of Abel’s collected works at the time, and was 
the cause of some friction between him and Bjerknes. The difficulty is that Jacobi 
himself admitted that his published proofs were different from his discovery method, 
and based his first study of the transformations on the inverted integrals. Moreover, 
Ore has argued convincingly (1957, 180-190) that Jacobi could well have seen Abel’s 
‘Recherches’ a little earlier when, under pressure from Legendre, to provide a proof, 
and with his application for a Professorship at K6nigsberg in the air, he suddenly 
found that a rival had transformed the whole subject. But even if the conclusion were 
to be that human weakness guided Jacobi’s hand, and the idea of inversion “in all 
probability was revealed to him through reading Abel’s ‘Recherches”’ (Ore 1957, 
184), what he did with the idea was still to be remarkable. 


8.3 Abel and Jacobi as Rivals 


Abel was certainly alarmed when he first heard of Jacobi. As Hansteen described to 
Schumacher in a letter that Schumacher quoted to Gauss (quoted in Ore 1957, 189): 


Abel sends herewith an article about elliptic transcendents, which he asked to have printed 
as soon as possible, since Jacobi is on his heels. The other day, when I handed him the last 
number of the Astronomische Nachrichten, he became quite pale and was compelled to run to 
the confectioner’s shop and take a schnapps of bitter to counteract his alteration. For several 
years he has been in possession of a general method which he communicates in this paper, 
and which includes more than Jacobi’s theorems. 
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His countermove was to send to Schumacher’s journal an article on the transforma- 
tion of elliptic functions but one written from a vastly more novel standpoint: “My 
Knockout of Jacobi” he called it in a letter to Holmboe. Off stage one participant 
was not impressed: Gauss replied to Schumacher (23 May 1828), who told Crelle, 
who in turn told Abel, that “he (Abel) has come about one third of the way that I 
have gone in my researches, with the same aim and even with the same choice of 
notation.” True to form, Gauss did nothing to encourage either man. 

Jacobi’s response to the ‘knockout’ was generous. He wrote to Legendre on 9 
September 1828 to say that. “It is as far above my praise as it is above my own 
work”, and that he valued it particularly for supplying the rigorous proofs his own 
work lacked. But he too switched journals to acknowledge the presence of the other 
man, and pointed out in a brief note in Crelle’s Journal 3, 1828, that he knew there 
were p+ | transformations of order a prime p and gave them explicitly as the result 
of substituting g? or any of the p distinct values of q!/? into 


2g'/4 4299/4 4 2925/4 4... 
Vk = 4 9 
1+2q +2q*+2q’+-:: 


The Jacobi-Legendre correspondence also shows that Jacobi immediately began 
to use Abel’s ideas in his own study of transformations. He even admitted to Legendre 
(12 April 1828) that he could not do without Abel’s analysis. It is in his letters that 
several of the striking infinite series that Jacobi was so adept at handling appear 
for the first time, and which will entertain us below. In Jacobi’s opinion the most 
remarkable of these was 


2K 
(FP) = 1420+ 2g4 424° +29" 


where K is a complete integral and g = exp (=# ) , but he also found this striking 
identity concerning eighth powers: 


8 8 
(= gd - 43) = 49)...) + 16g (+40 +44) +4) --.) 


8 
=(d+pa+q0+¢)...) . 


Jacobi’s first proofs of these identities depended on the expression for sin amu as 
a quotient of infinite products, which at that time had only been proved by Abel, and 
Legendre expressed himself pained by this admission: 


which testifies to your candour, a quality that accords so well with real talent (...but) having 
done justice to the beautiful work of M. Abel I place it far below your own discoveries and 
would like the glory of them, that is to say of their proofs, to belong entirely to you. 
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Jacobi later found independent proofs of the crucial theorems, as he told Legendre 
on 16 June 1828. 

By now Crelle was helping to get Abel a job somewhere in Europe, and Jacobi 
wrote to Crelle full of praise for Abel’s work. It was known that Abel had contracted 
tuberculosis, and in September 1828 Legendre, Poisson, Lacroix, and Baron Maurice 
(a Swiss mathematician and member of the Paris Academy of Sciences) wrote from 
the Institut de France to the Swedish King Karl Johan urging him to find a place for 
Abel in Stockholm. In October Legendre finally wrote directly to Abel, who replied 
(25 November 1828) that 


Jacobi will certainly perfect to an undreamed-of degree the theory of elliptic functions but 
even mathematics in general. No-one can esteem him more highly than I do. 


Abel was by then back in Norway, where his health collapsed towards the end of 
February, and he died on the 6th of April, 1829, at the age of only 26. 

It was inevitable that Abel’s reputation would decline for a while after his death 
as Jacobi continued to develop the subject of elliptic functions his way. The paper 
that eventually restored Abel’s reputation was a short note in Crelle’s Journal in 


1828, in which he discussed integrals of the form [ ae, where r(x) is an arbitrary 


rational function and R(x) is an arbitrary polynomial. This was but a fragment of a 
much longer paper he had submitted to the Académie des Sciences in Paris in 1826 
dealing magnificently with the general case of arbitrary integrands, but at the time 
of his death the paper was sitting as good as forgotten in a pile of papers in the care 
of Cauchy. 


8.4 Jacobi’s Fundamenta Nova 


In 1829 Jacobi published the book that was to become the definitive account of the 
theory of elliptic functions for at least a generation, his Fundamenta nova theoriae 
functionum ellipticarum (or, New foundations of a theory of elliptic functions). It 
opens with the transformation problem for elliptic integrals. He wrote: “The problem 
we shall propose is generally this: to find a rational function y of x such that” 


dy _ dx 
JA+ By +Cy2+ Dy3+Eyt VJA+Bx+Cx?+Dx3 + Ext 


He then went on to the related multiplication problem, which is to find solutions to 


dy = Mdx 
VA+ By +Cy?+ Dy3+ Ey4 JVA+Bx+Cx? + Dx3 + Ex4 


where the quantity M, which may depend on the parameters A, B,..., E, does not 
depend on x. 
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He gave the example of the transformation of order 3: 
U=x(ata'x’), V=14+d'x’, 
In this case the moduli ) and k are related by this equation, where u* = k and v+ = ): 
us —v* + 2uv(1—u?v") =0 


and the multiplier is M = fae” He also showed that the transformation of order 5 
produces this relation between u and v: 


u® — v® + 5u7v*(u? — v*) = 0, 


where M = wwe), Equations like these between two moduli he called modular 
equations (§24); they later caught the attention of Galois, because the cases of low 
degree can be remarkably simplified. 

Jacobi sketched an argument to show that if one substitutes y = v. where U 
and V are polynomials in x of degrees p and m respectively, and p > m, then the 
integrand in y reduces to another of the same form, possibly multiplied by a constant, 
provided that the coefficients of U and V can be suitably chosen. When this is the 
case, the transformation is said to be of order p, and Jacobi showed by a counting 
argument that this can be done for any value of p. More specifically, he showed that 
there are more coefficients in the polynomials U and V than the number of equations 
they have to satisfy—a typically 19th Century approach. In the book, however, Jacobi 


solved these problems by inverting the elliptic integral 


y 


4 & 
dt -/ ds 
V1—k2sin?t J J —s2)(1 — k2s2) 


u= 


0 


(where sin y = x) to obtain elliptic functions, solving the problems there, and then 
transferring them back to the elliptic integral s. Jacobi showed that transformations 
of every order exist, and investigated the number of each order. He studied the 
effect of a transformation on the moduli, and showed how the composition of two 
transformations can be made to yield a solution to the multiplication problem. 

To study the elliptic functions, he followed Legendre in calling @ an angle and 
regarding it as the amplitude of the function u. He wrote @ = am(u),x = sin(am(u)), 
and called x an elliptic function. He defined the complete elliptic integrals 


n/2 


/ dt 
k= | —___ 
1 —k? sin? t 
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and 
n/2 


Roy dt 
) 1— Kk? sin? t 


where k and £ are complementary moduli, that is, k? +k’ 1, Elliptic functions with 
a purely imaginary argument were defined by introducing a new variable defined by 
the transformation sin ¢ = i. tan w, cos @ = sec w. He then used an addition theorem 
to define elliptic functions for arbitrary complex variables, and finally deduced that 
the new functions were doubly periodic, with periods 4K, which is real, and 4i K’, 
which is purely imaginary, at least when the modulus k is real. Both are imaginary 
when the modulus itself is imaginary. 

Jacobi showed, as Legendre had earlier, that the complete elliptic integrals satisfy 
the differential equation 


dQ 
dk 


+ (1 — 3k?) kQ=0 
and he used this differential equation to the study properties of the multiplier M. 

Jacobi devoted the second half of the book to the study of the new elliptic functions. 
He showed how to write them as quotients of one infinite product by another, and 
how investigated how the complete elliptic integrals depend on the modulus. This 
led him to such striking formulae as (§36, nr 7): 


+ qd+q* + -) 


i va ( d+qd+q)d +45) 


where g = exp(—3&), 

In §52 Jacobi introduced a quantity that was to play a more important, foundational 
role in later versions of the theory: the theta function, here defined by the formula 
(p. 198) 

u 


O(u) = 0(0) exp [zat 


0 


where Z was defined in terms of the complete and incomplete elliptic integrals of 
the first and second kinds by the formula 


_ F'E(¢) — E'F(@) 


Z(u) Fi 


In what follows we stick to his later definition, as he did, which is related to this one 
but so obscurely that I omit it.4 


4See Whittaker and Watson (1927, 479). 
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He investigated his theta function as a function of a complex variable and showed 
that it was periodic but not doubly periodic. It satisfies 


6(u + 2K) = O(u), and 0(u + 21K’) = —exp (“> ~) -O(u). 


Jacobi also noted that the theta function satisfies the heat equation. 

The book is full of power series in terms of the variable g and identities involving 
the theta function and the elliptic functions. These included some that he had already 
shown to Legendre, such as 


2K 
(=) = bog 49g" 2g" ag a2: 
where K is a complete integral, and the striking identity concerning eighth powers: 
3 5 2 4 6 : 
(a-gd-4)d-4*)...) +16(g +90 +40 +4...) 
3 5 8 
=((+p0+@)0+4°)...) . 
He also found series for 


. 2Kx 2Kx 
sinam (=) and 0(0)0 ( ) ; 
T T 


and came close to writing the function sin am as a quotient of two theta functions. 


8.5 Elliptic Functions and Complex Functions 


Can the first theories of elliptic functions be regarded as theories of functions of a 
complex variable? There is no question that they promoted the development of that 
theory, as this book will show, but that cannot obscure the fact that the theories were 
entirely formal. Abel and Jacobi did not discuss what they took a complex variable 
to be—or even a complex number—and surely thought of a complex variable as 
an expression of the form x + iy, to which the usual rules of arithmetic apply and 


i? = —1. The difficulty comes with defining a complex function, and so justifying 


u 
such arguments as this: [ 4 = v because the substitution s = it implies that 
) vI- 


iu 
Fi aS = v, and so u(v) = iu(iv). If the addition theorem is proved only for real 


variables, or perhaps also for purely imaginary ones if one accepts the validity of the 
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substitution just described, is it then proved for all imaginary variables? What, if so, 
is the corresponding theory of complex integrals? 

Abel and Jacobi would very likely have appealed to the common-sense meaning 
of the formulae that resulted. They might have argued that the fact that the integral 
is not well defined once the variables have become complex but is an infinitely 
many-valued expression is somewhat beside the point. They could have pointed to 
log x, which loses its unique meaning once x can be complex, and argued that log x 
always arises in a context that will make clear what it means. Nonetheless, they might 

x 
have said, just as mathematical common-sense enables one to interpret a and to 
I 
know that dex = 1. so too does simple formal algebra enable you to work with 
(many-valued) elliptic integrals and complex-valued elliptic functions. 

Moreover, the early developments of complex function theory, then almost exclu- 
sively in the hands of Cauchy, were confused. Doubly confused, in fact: mathemati- 
cally confused, in that Cauchy’s own ideas were in a turbulent state of development, 
and confused as regards their dissemination. This may be why, in the first flood of 
invention, neither Abel nor Jacobi mentioned Cauchy’s work, and nor did Legendre 
suggest that they might consult it. In fact, it is clear that in the first phase of the 
creation of a theory of elliptic functions there was no theory of complex integrals. 
Furthermore, the fact that Cauchy’s theory of complex integrals made no mention of 
multi-valued integrands was to drive people, including Jacobi in the 1830s, to seek 
other foundations for elliptic functions altogether, and to abandon the starting point 
of elliptic integrals. 

Other features of what was to become complex function theory are also lacking in 
the elliptic function theory of the 1820s. Legendre’s classification of elliptic integrals 
into three kinds, and Abel’s later study of arbitrary algebraic integrals, remained 
purely algebraic, lacking the idea that the integrands either have no poles, or simple 
poles, or poles of higher orders. Similarly, the elliptic functions become infinite, but 
there is no understanding of poles. We can conclude that Abel’s and Jacobi’s theories 
of elliptic functions were purely formal. 

But that does not mean the theories were not novel and exciting; plainly they were. 
The new functions were an extension of the trigonometric functions and an addition to 
the short list of functions of a complex variable that were known at the time. Learning 
to use these functions was a task mathematicians found intrinsically attractive as well 
as useful in diverse fields. However formal, they were a theory of complex integrals 
and intrinsically complex valued functions of a complex variable, and they were to 
suggest to later generations of mathematicians that they way to handle them would 
be to incorporate them into a general theory of complex functions. 
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8.6 Two Applications 


An Application to Number Theory 


Fermat had claimed, without proof, that every positive integer is the sum of at most 
four squares. In a letter to Goldbach in 1749 Euler suggested that the most natural 
way to prove Fermat’s theorem would be to use power series; an argument Lagrange 
was able to conclude in 1770.> Jacobi, without knowing of Euler’s letter, first stated 
his result about the number of distinct ways a number is a sum of four squares in a 
letter to Legendre on 9 September 1828. He announced it publicly in Crelle’s Journal 
(vol. 3, 1828) and gave his proof on the last page of his Fundamenta Nova in 1829. 

Jacobi found that the coefficients 7, in one of his power series were essentially the 
number of ways n could be expressed as a sum of four squares. From his knowledge 
of the function that this power series represented he deduced this identity for ry: 


ins Do aeal ¢ 


In words, ry is eight times the sum of the divisors of n that are not multiples of 4.° 
Jacobi’s observation to Legendre concerned the identity 


ae 84 16g? 24q3 

=~) =1 

(=) "Tog 1-@  1-@ 
8q 8q 

=1+ + + ee 
(=¢? (lg)? Ug? 

=1+8 5° 6(p) (4? +397? + 344? + 3q*” +...) 


where the sum is taken over all odd numbers p and ¢(:) is the sum of the factors of 
p. The series on the right hand side contains every power of g, and by comparison 
2K 
qT 
four squares. In the Fundamenta Nova Jacobi went further to deduce that the number 
of ways a number is the sum of four squares is 8 times the sum of its divisors that 
do not divide 4. 

To explain Jacobi’s results, we need to look a little more closely at how he obtained 
them.’ Jacobi worked with a power series (later) denoted 


with the fourth power of the series for one finds that every number is the sum of 


lo.) 
63(0, q) := 1+ Si". 


n=1 


5See Fuss (1845, vol. 1, 522-524), letter OO 861, also Weil (1984, 179). 

Note that on this count d? and (—d)? are different. 

7See the very helpful exposition by Eric Conrad: Jacobi’s Four Square Theorem, at http://www. 
math.ohio-state.edu/~econrad/Jacobi/sumofsq/sumofsq.html. 
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If you regard this as a series 1+ >* : ajqi you see that the coefficient a; is the number 
of ways the integer j can be expressed as a sum of one square (i.e., a single square), 
where we count n” and (—n)? as distinct. Therefore the coefficient of j in the power 
series expansion of 63(0,q)” = 1+ Zi ajq! represents the number of ways j can 
be written as a sum of m squares, where we count the order of the summands.® 

If you write down the first few terms of 03(0, q)* you get 1+ 8q + 24q* + 32q? + 
24q* + 48q> + 964° + 64g! + 24q® + 104g? + 144g!° + --- . This already makes 
it likely that every number can be expressed as a sum of four squares (Lagrange’s 
theorem), but it doesn’t prove it—it is possible that the coefficient of some high 
power of q might vanish—nor does it enable us to write down the number of ways 
any given number j can be expressed as a sum of squares: we need a formula for the 
coefficient of g/. But Jacobi knew that the theta function was (singly) periodic, so he 
did exactly what Fourier had shown was the natural thing to do: express a periodic 
function as a Fourier series. He scaled the variable so that the elliptic function went 
from having period of 4K to a period of 27, by using the variable ok ~ and obtained 
these Fourier series: 


2K (2Kx AS g@tne 
sn( ; ) — sin(2j + 1)x. 


— q2jtl 
ko = git) 


2K Kx oe g@2itb/2 ; 
= an (= : b) =" cee qty cos(2j + 1)x. 
re 
2K 2K 
Zan (= a w=1442 Ta aaa 


Knowledge about elliptic functions therefore provides knowledge about these 
power series. For example, the series for the function dn, with z = 0 is 


2K qi 
Sees ed 
1 Digg 


and the series for its square is 


2K 
(*) - Ea eee: Die 


2K 
63(0, q) = (=). 


8You can check that in this way the number 5 = 1? + 2? can be represented in 8 different ways: 
switch the order of 1 and 2 , replace | with —1, or 2 with —2. 


And, splendid to say, 
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2K 


Jacobi expanded the series for = as a power series and found that it came out as 


1+4 >.) -— a))a!, 


j=l 


where dj(j) is the number of divisors of j that are congruent to | mod 4 and d3(/) 
is the number of divisors of j that are congruent to 3 mod 4. This gave him his Two 
Squares theorem: The number of ways a positive integer can be a sum of two squares 
is equal to four times the difference of the numbers of divisors congruent to 1 and 3 
modulo 4. 


When Jacobi expanded the series for (24 


2 . 
=*) he found that it came out as 


1+8 > (i). 


k| j, Atk 


where the sum is taken over all divisors of j that are not divisible by 4. This gives 
Jacobi’s four squares theorem: The number of ways an integer can be written as the 
sum of four squares is equal to eight times the sum of all its divisors which are not 
divisible by 4. Lagrange’s theorem follows by observing that | divides every integer 
and | is not divisible by 4. So every number has at least 8 representations as a sum 
of four squares. 


An Application to Geometry 


Jacobi also boosted the new theory of elliptic functions with an attractive proof 
of a striking geometrical theorem called the Poncelet closure theorem. Jean Victor 
Poncelet had given his own difficult and delicate proof in his Traité des propriétés 
projectives des figures (1822); Jacobi’s was much more direct.” 

Take any two conics C and D with C inside D, say, and suppose for simplicity that 
both of them are circles, centres Co and Do respectively. An arbitrary point Pp on D 
is chosen, and from it a tangent to C is drawn that meets D again at a point P;. The 
construction is repeated starting with P; to obtain P2, from P2 obtain P3, and so on, 
and in this way a sequence of points P,, is obtained on D. Poncelet’s theorem asserts 
one of two things happens: either the sequence of points is infinite, or for some n, 
P,;, = Po, and the sequence closes, and therefore there is an n-gon inscribed in D and 
circumscribing C. This is merely a logical dichotomy; but Poncelet also showed that 
if there is an interscribed n-gon for some choice of initial point Pp then there is an an 
interscribed n-gon for any choice of initial point Po. In other words, the behaviour 
of the sequence is determined by the two conics and their relative positions, but not 
by the initial point. 


°The theorem has a complicated history that cannot be discussed here, see Bos et al. (1987) and, in 
greater detail, A. del Centina’s article ‘Poncelet’s porism, a long story of renewed discoveries’ to 
appear in Archive for history of exact sciences. The underlying reason for the appearance of elliptic 
functions is explained in Griffiths and Harris (1978). 
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The German geometer Jakob Steiner had already investigated the special cases 
n = 5 andn = 6, and Jacobi followed his his lead and investigated the relationship 
between ¢, and ¢,_; where ¢, is the angle P,,Cy makes with Cp Do. He found that 


Gn+2 + On R-a 
t = t 
an( ; Fata Ont 


where R is the radius of D and a = |CoDo|. 
“Tn this form of the equations” said Jacobi (1828c, 285), “it springs to the eyes at 
once that they coincide with those for the multiplication of elliptic functions”.!° He 


QD 
set u = f where @ = am(u), and k is an explicit function of R and a, 
0 


dt 
and deduced that ¢, = am(u + nt). The necessary and sufficient condition for the 
n-gon to close is that am(u + nt) = am(u), in which case nt is a period, and this is 
independent of u. 
This proof was so unexpected and simple that later writers have referred to it 
as Jacobi’s geometric proof of the multiplication formula for elliptic functions—a 
complete reversal of his original intention. 


'0For example, those given earlier by Legendre, see Sect. 3.3 above. 


Chapter 9 
Gauss 


oe 


Carl Friedrich Gauss(1777-1855) 


9.1 Introduction 


After Gauss died in 1855 mathematicians who looked at his unpublished papers were 
astonished to discover how much he had known and never revealed. His sympathy 
for the work of Bolyai and Lobachevski, for example, was decisive in awakening 
the first positive readings of their work on the new, non-Euclidean, geometry. As 
surprising was the extent of his investigations into the theory of elliptic functions, 
an effect enhanced by the fact that he had made considerable progress with them 
even before Abel and Jacobi were born. He began by inverting the simplest elliptic 
integral, the so-called lemniscatic integral, thereby creating a new family of complex 
functions, and he rapidly became convinced that it would be necessary to build up a 
theory of complex functions in general, as we shall see. 
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9.2 Lemniscatic Functions 


Gauss worked on the subject of elliptic functions intermittently for 35 years, and 
because he scarcely published on the topic the historian is in the fortunate position 
of having an extensive array of working notes and drafts that document how he came 
to his discoveries. We also have a diary! that Gauss kept in his early years as a 
record of his discoveries. These documents were published after his death in various 
of the volumes that make up Gauss’s Werke, and have been frequently and expertly 
analysed over the years by Schlesinger (1912), Klein (1926-1927), Geppert (1927), 
and, more recently, Cox (1984). His progress is strikingly logical, and the questions 
that he could not readily answer became the focuses of particular interest for him 
and guided his research. 

From Gauss’s diary we learn that Gauss first considered the lemniscatic integral 


; dt 
— / Ja-" 


in September 1796. He treated the integrand as a function of ft, expanded it as a 
power Series and integrated it term by term, thus obtaining x as a power series in 
z. He then inverted this series to obtain z as a power series in x, thus inverting the 
lemniscatic integral. In January 1797 he read Euler’s posthumous paper (Euler 1786) 
on elliptic integrals and Stirling’s book Methodus differentialis of 1730, and learned 
the remarkable result that if 
1 1 
r / dx dB / x7dx 
= | ————- an = | ——— 
‘ V(1—x*) v(1— x4) 
then AB = §. He recorded in his diary that he now began “to examine thoroughly 
the lemniscate” (Diary entry nr. 51). He decided to call the function z of x defined 
above s/ for sinus lemniscaticus and introduced the corresponding cosine, cl(x) = 
sl(w/2— x). Euler’s addition formula for the lemniscatic integral could therefore be 


written as 
sl? + cl? +cl?sl* = 1. 


From a draft Gauss wrote on or before 1801 we see that he derived formulae for 
sl(2x), sl(x; + x2), and, for example, for s/(3x), which, setting s for s/(x), is: 


s(3 — 6s* — 58) 


sl(3x) = eae ee 7654 = 358 


‘Reprinted, with an English translation, a commentary and corrections in Dunnington (2004). 
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This shows that division of the lemniscatic sine by 3 leads to an equation of degree 9, 
whereas division of the ordinary sine leads only to acubic equation.” Gauss also wrote 
down the corresponding expression for s/(nx), and on March 19 1797 he noted in his 
diary (entry nr. 60): “Why dividing the lemniscate into parts leads to an equation 
of degree n?”. The (unstated) reason is that all but n of the roots are complex. This 
observation prompted Gauss to regard s/ and c/ as functions of a complex variable, 
as the draft we are following shows (see Gauss Werke 3, 407). The change of variable 
from x to ix in the lemniscatic integral establishes that s/(ix) = isl(x), and so Gauss 


wrote down formulae for s/(x; + ix2), from which it follows that cl(ix) = g ao 


Gauss defined ¥ to be the value of the complete lemniscatic integral i Ja 
and observed that sl (@) = 0, so from the addition formula it follows that as a real 
function, s/ is a periodic function with period 2a. Therefore the complex function 
sl had two distinct periods, 2@ and 2ia, and so, when m and n are integers, 


sl(x + (m+in)2@) = sl(x). 
This is the first occurrence of the Gaussian integers, m + in (see Werke 3, 411-412). 


Gauss now knew all the points where s/ and cl were zero or infinite, and so he 
proceeded to investigate s/(x) and cl(x) as quotients of two infinite series. He set 


and considered the functions M and WN as functions of a complex variable in their 
own right. Much as one might with the trigonometric functions on first acquaintance, 
he computed special values of them from the known values of s/(x) and cl(x) and 
formulae such as the addition formula. One result struck his as particularly intriguing. 
He calculated N(c) to 5 decimal places and its log to 4, and noted that this seemed 
to be 7/2. He connected this to Euler’s remarkable result that AB = 7/4. Because 
A = w/2, 2B must be 7/7, giving another connection between @ and 7. Gauss 
now wrote “Log hyp this number = 1.5708 = 5 of the circle” (Werke, X.1, 158). In 
his diary (March 29, 1797, nr. 63) he checked this coincidence to 6 decimal places, 
and commented that this “is most remarkable and a proof of this property promises 
the most serious increase in analysis”. 

He returned to these calculations for @ a year later, in July 1798, when he had 
a better grasp of M(x) and N(x), and he commented in his Diary (July, nr. 92): 
“we have found out the most elegant things exceeding all expectations and that by 
methods which open up to us a whole new field ahead”. Not for the first time the 
diary reflected some understandable optimism. It was to take Gauss a year to enter 
this new field, and the key was an old love of his, the arithmetico-geometric mean. 


2These and other formulae come from Gauss Werke 3, 403-412. 
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9.3 The Arithmetico-Geometric Mean 


The arithmetico-geometric mean is defined for two real numbers a and b as follows: 
set ag = a and bo = Db, and recursively 


1 
Ant = 5 (dn + bn) and by 41 = VGnbn). 


The two sequences {a,} and {b,} converge to the same value, called the arith- 
metico-geometric mean (or agm) of a and b, written M(a, b). It is clear that 


M(a, Ab) = AM(a, b), so M(a, b) = aM(1, b/a), 


so it is enough to study M(1, x) as a function of a single variable. 

On May 30 1799 Gauss wrote in his Diary (nr. 98): “We have found that the 
arithmetico-geometric mean between | and 2 is 77/7 to 11 places, which thing being 
proved a new field in analysis will certainly be opened up.” The proof, however, was 
elusive, as a letter from Pfaff to Gauss in November 1799 (Werke, X.1, 232) makes 
it clear. 

One of Gauss’s earliest results about the agm was a power series for M(1, 1+ x). 
He derived it in 1800, by setting x = 2¢ + t?, so 


M(,1+x)=M(,1+2t+f?) =M(14+t,14t+?7/2) 


12 
=c+om (11+ 75 —) (9.1) 


sO 


t2 
M(,14244+ 7) =0+9M 11,14 —— |. 
(,1+2t+f°)=(U+1) ( ‘aoa) 


Gauss then wrote M(1,1 +x) = 1+ ax + bx? +--- and used the method of 
undetermined coefficients to obtain two expressions for M in the above equation. By 
expanding both sides and collecting like terms he obtained the power series expansion 
of M(1, 1+ x), and found that it begins 


2 3 4 


MC 142) 142 x ae x 4, 
’ x)= 5a 
2 16 32 1024 


(9.2) 


This allowed Gauss to find other, related power series, such as the one for M(1 + 
x, 1 — x), and its reciprocal, which turned out to be most important in all the future 
developments.’ It has this strikingly regular expansion: 


3Schlesinger, in Gauss’s Werke, vol. X.2, 63, suggested that it was the fact that the reciprocal of 
M(1, V2) occurs which led Gauss to consider not the agm in general but its reciprocal. 
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K(x) (ee (9.3) 
x)= = nes . 
M( +x,1—x) 4° 64 " 256 


2 
the coefficient of x” is (35254) . Gauss showed formally that the function 


K (x) satisfies the linear ordinary differential equation 
(x? — x) K" (x) — 3x? — 1) K(x) +xK(x) =0 (9.4) 


and noted that another independent integral of this differential equation is M~! (1, x). 

Gauss connected this result with an earlier calculation of his. As soon as he had 
considered the functions s/ and c/ as functions of a complex variable, he had written 
down the formal Fourier series expansion of Tae 


1+sin2 
1 
————— = a+ bcos2V + ccos4V + dcos6V +--- (9.5) 
V1+ sin? V 
and observed that 
1 
; fy. PS 2 i dx 
a= 
22 -2?.42 am) J, — x4 
0 


—_ |) 
/1—y sin? V’ 
reciprocal of M(1, ./1 + y) is the V—free part (as he called it) of the expansion of 


—_—__.. This gave him that 
/1—y sin? V 


which is, of course w/a. Now he expanded and wrote down that the 


1 
dV 
Md, VJ/1+y) (| —— =1/2. 
i V1+4+y? sin? V 
Writing y = —k’, this becomes 
1 
dV 
M(1,/(0 —k?))- | —.————— = 7/2. 
J vi+k sin’ V 


So Gauss had shown that the complete elliptic integral, regarded as a function 
of the modulus k, was given by the reciprocal of the agm at the complementary 
modulus, k’ = V1 — k?. 

The significance of the clue given by the value of 7/a is now revealed: when 
k =i the above expression for M(1, x) reduces to 
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n/2 


dV 
ma.v%- | = 2. 
‘ V1+sin? V a 


With the known value of w/2 for the complete lemniscatic integral 


x/2 1 


| dV -/ dx 
‘ V¥1+sin? V fla 


the above equation becomes M(1, /2) = 1/a. This gave Gauss the conceptual 

explanation that he sought for the numerical coincidence that had intrigued him for 

so long, and it had been obtained by generalising the original question about complete 

lemniscatic integrals to the setting of complete elliptic integrals. The enigmatic value 

of N(q@) was simultaneously explained from the power series expansion of N, for 
TT 


the value of [ iL. 
0 a/ 1—k* cos* 


a complete elliptic integral of the first kind, is 7N(k). 


9.4 Elliptic Integrals 


By May 6, 1800 Gauss believed that he had led the theory of elliptic integrals “to the 
summit of universality” (Diary entry nr. 105). However, he soon noted that the theory 
was “greatly increased and unified”, becoming “most beautifully bound together 
and increased infinitely” (May 22, diary entry nr. 106). These entries, and two more 
written on or before June 3, show that Gauss had now proceeded to invert the general 
elliptic integral, subject only to the restriction that the real modulus is k. 

Gauss remarked in May 1800 (Diary entry nr. 108) that the new elliptic functions 
are doubly periodic and have zeros and poles and can be expanded as quotients of 
power series. His approach was two-fold: in one direction he inverted an elliptic 
integral depending on a parameter k; in the other he developed doubly periodic 
functions directly as quotients of entire functions. This raised the question of whether 
the two approaches define exactly the same objects. 

It was easy enough for Gauss to mimic the arguments for the lemniscatic integral 
in the general case, and conclude that the general elliptic function can be represented 
as a quotient. It proved much harder to show that every doubly periodic function 
defined as a quotient arises as the inverse of a suitable elliptic integral. The problem 
is to find the real modulus k from the periods, and Gauss answered this question 
affirmatively with a long argument involving the agm that will not be repeated here. 
We can, however, note, that the formula connecting the complete integral and the 
agm had already given him an expression for a period in terms of the modulus. Gauss 
now found an expression for the modulus as a function of the quotient of the periods 
(called the modular equation) and argued that this function could take any value. 
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Gauss now proceeded to push beyond the case where the modulus is real and 
to allow it to be complex. This restriction had arisen because of the way elliptic 
integral s had arisen in geometrical and mechanical problems, but once the modulus 
is complex the ratio of the periods can no longer be purely imaginary, and so the agm 
must itself become a complex-valued function of a complex variable. 

But this forces the investigator to decide at every stage which square root /anbn 
to choose. This raises a very difficult problem indeed, but one Gauss quickly solved. 
He wrote in his diary on 3 June 1800 (diary entry nr. 109) that connection between 
the infinitely many means has been completely cleared up, and two days (diary entry 
nr. 110) that “We have now immediately applied our theory to elliptic transcendents”, 
confirming the interpretation that the way Gauss took to generalising his theory of 
elliptic functions was to make the agm into a complex function. But we cannot be sure 
that Gauss had worked out the full solution at this date—the entry may be another of 
Gauss’s optimistic ones—and unfortunately, nothing survives from 1800 to indicate 
what Gauss’s solution was. The best indication we have is from a note from as late 
as 1825, although Schlesinger and Cox argue that the Diary entry nr. 109 of June 3 
1800 enables one to date the discovery to 1800. 


9.5 Gauss and Complex Functions 


I argued earlier that the work of Abel and Jacobi was purely formal in its handling of 
complex numbers and complex functions. The case of Gauss is rather different. It is 
clear from the way he presented complex numbers in his Disquisitiones Arithmeticae 
of 1801 that he was comfortable with their representation as points in the plane. This 
is already one step beyond the formal, algebraic approach. 

Gauss also left elliptic integrals behind when he came to consider elliptic functions 
with a complex modulus. He did not start with the integrals and invert them, but 
worked exclusively on generalising the power series side of the real theory to the 
complex case. This suggests that he was not willing to deal with complex integrals, 
but that he was willing to deal with complex power series, and indeed with linear 
differential equations in the complex domain, as other work of his confirms. 

Schlesinger suggested (1912, 184) that Gauss’s mostly unpublished theory looked 
like this by 1828: 

The first third was the general theory of functions arising from the (hypergeometric series), 

the second the theory of the agm and the modular function, and finally the third, which Abel 


published before Gauss, was the theory of elliptic functions in the strict sense. 


The hypergeometric series 


(9.6) 


F(a, 8,9.x) = 14 ($2) x4 (Soe ta, 


17 29+ D 
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(where the variable x is a complex variable of modulus less than | so that the series 
converges) was the subject of a paper Gauss published in 1812. There he showed that 
the series is intimately connected to the trigonometric functions, and that it arises in 
the Fourier series expansion of (a? + b* — 2ab cos @)~", so it is of practical use in 
astronomy. He also used it to give a quick proof of formulae that, he said, Euler had 
worked very hard to obtain, such as the formula that had intrigued him for so long, 
A-B=7/4. 

More importantly, Gauss had gone further in the second part of this paper, which 
for some reason he did not publish. He showed that the hypergeometric series satisfies 
the hypergeometric differential equation 


2 

(« 2) oF+0 ini puch ae ah (9.7) 
x dx 

From this he deduced that the hypergeometric equation defines a function of a com- 

plex variable x everywhere except at the points x = 0,x = 1, and x = ov. This 

function can be represented inside the unit circle by the hypergeometric series, but 

unlike the series it makes sense outside it. 

Gauss’s discussion of the following paradoxical result (his term) shows how good 
his insight into the nature of complex functions already was. He first set x = 4y— y, 
and y = a+ 3+ 1/2, when F becomes equal to F(2a,28,a+ 8+ 1/2, y), and 
then changed y to 1 — y and found that his earlier results implied that 


F (20.26.0464 5.9) =F (20,28,.0+8+5.1-9) (9.8) 


“which equation is certainly false’, as he said. He then resolved the paradox by 
distinguishing between the function F and the infinite series that represents the 
function only when the variable is less than one in absolute value. The series takes 
a unique value for each value of the variable for which it is defined and convergent, 
whereas the function does not. The false equation is meaningless as a result about 
series because they have distinct domains of convergence, and misleading as a result 
about functions because the functions are many-valued. One could no more deduce 
a false result here, he remarked, than one could validly infer from arcsin(1/2) = 30° 
and arcsin(1/2) = 150° that 30° = 150°. 

Even in this unpublished paper Gauss did not explain how the hypergeomet- 
ric series arose in the complex theory of elliptic functions. As we saw above (see 
Eq. (9.4)) he knew in 1800 that the functions y; = (denoted K, above) 


and y2 = 7 as satisfy the differential equation: 


1 
M(\+x,1—x) 


(8 - x) 24 (2-1) 2a sy=0 (9.9) 
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2 


This is not a hypergeometric equation, but on making the transformation x~ = z it 
becomes the equation 
d*y dy _y 
1 + (1-2 =0 9.10 
a(lL=—2) ie ( z) de (9.10) 


which is a hypergeometric equation with a = @ = 1/2,y = 1—indeed, it is 
Legendre’s equation. And, as Schlesinger pointed out, Gauss’s derivation of the 
hypergeometric equation for the hypergeometric series follows exactly his deriva- 
tion (Werke vol. X.1, pp. 177-180) of the power series for FES a ea| described 
above. Schlesinger therefore conjectured that it was Gauss’s recognition of the great 
generality of the hypergeometric equation that caused him to change his direction of 
research. Gauss’s theory of the modular function k = k(#) and the connection with 
differential equations (such as Legendre’s) mark a profound insight into the connec- 
tions between elliptic functions and a burgeoning theory of complex functions of any 
kind. 

Was Gauss therefore right to say in 1828 that Abel had come only one-third of the 
way? After all, as Gauss wrote to Bessel (Gauss 1880, 477), Abel “has followed the 
same path that I embarked upon in 1798, so that the great coincidence of the results 
is not to be wondered at ...”. This ignores Abel’s consideration of the extent to which 
the division equations are solvable algebraically, but Gauss would have noticed that 
Abel did not seem to appreciate the infinite series and product representations he had 
found, and discussed elliptic functions with a real modulus only. 

On the other hand, very soon Jacobi’s Fundamenta nova and Abel’s ‘Précis’ dis- 
played a general theory of the transformations of elliptic functions that surpassed 
anything Gauss had written down. Jacobi’s book, and Abel’s paper (1828a), did 
develop a theory of theta functions, albeit only for elliptic functions with real mod- 
ulus, or (in Abel’s case) a purely imaginary modulus. Even so, the theories of Abel 
and Jacobi remained entirely formal in their treatment of complex functions. 

Gauss’s theory of elliptic functions was not only of greater scope that Abel’s and 
Jacobi’s, it was deeper mathematically, because Gauss was clear about the importance 
of the complex domain. As he put it in a letter to Bessel on December 18, 1811 (Werke, 
X.1, 366), he asked of anyone who wished to introduce a new function into analysis 
to explain: 


if he would apply it only to real quantities, and imaginary values of the argument appear so to 
speak only as an offshoot, or if he agrees with my principle that the imaginaries must enjoy 
equal rights in the domain of quantities with the reals. Practical utility is not at issue here, but 
for me analysis is an independent science that through the neglect of any fictitious quantity 
loses exceptionally in beauty and roundness and in a moment all truths, that otherwise would 
be true in general, have necessarily to be encumbered with the most burdensome restrictions. 


In this regard, the most difficult question is to understand what is meant by an 
integral in the complex domain. In the same letter to Bessel he wrote that the value 
of a complex integral depends on the path between its end-points, and 
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b 
The integral [ ¢x.dx along two different paths will always have the same value if it is never 


a 
the case that ~x = oo in the space between the curves representing the paths. This is a 
beautiful theorem whose not-too-difficult proof I will give at a suitable opportunity .... In 


b 
any case this makes it immediately clear why a function arising from an integral [ }x.dx can 
a 
have many values for a single value of x, for one can go round a point where yx = ov either 
b 
not at all, or once, or several times. For example, if one defines logx by [ dx/x, starting 


a 
from x = 1, one comes to log x either without enclosing the point x = 0 or by going around 
it once or several times; each time the constant +27i or —27i enters; so the multiplicity of 
logarithms of any number are quite clear. 


Gauss was never to find the suitable opportunity, but this letter makes it clear that 
he had obtained the first crucial insight into the integration of functions of a complex 
variable. The theory is necessarily geometrical, and cannot be simply formal. 


9.6 Exercises 


1. Write down a short summary of the properties of Abel’s elliptic functions, explain- 
ing what is meant by double periodicity, and the zeros and infinities of these 
functions. 

2. Jacobi’s theorems about the representation of numbers fit into a context of similar 
theorems of this kind. What can you find out about Fermat’s conjecture that every 
number is the sum of n n-gonal numbers (three triangular numbers, four square 
numbers, and so on)? 

3. What was the importance of the hypergeometric equation by the 1820s? 


Chapter 10 
Cauchy and Complex Function Theory, 
1830-1857 


10.1 Introduction 


Cauchy’s publications in the 1830s were scattered over several journals, one of which 
he created for the purpose, while he lived first in Turin and then Prague, where he 
attended the Bourbon Court as a tutor to the Duke of Bordeaux, the grandson of the 
former Bourbon monarch Charles X. As Belhoste describes (1991, Chap. 10) this 
was an unrewarding experience dealing with a petulant boy that diminished even 
Cauchy’s energies for research. Cauchy concentrated on mathematical optics and 
the theory of light, but, as ever, he worked on many subjects, including the theory 
of functions defined by power series in a complex variable. When Dauphin turned 
18 in September 1838 Cauchy’s services were no longer required, and he returned 
to Paris with the title of Baron and a number of mathematical notebooks that would 
inspire him for several years. 

In this chapter we look at Cauchy’s erratic progress towards recognising the 
appropriate defining features of a complex function and his final appreciation of 
the “Cauchy—Riemann equations” and their implications. 


10.2 Complex Functions and Power Series Expansions 


By the time Cauchy took up his academic position in Turin he had already published 
a long paper in three parts (Cauchy 1830-1831) in the Italian journal Biblioteca Ital- 
iana, in which he had set out his approach to rigorising the calculus. This provocative 
paper generated a hostile reaction among Italian mathematicians still attracted to the 
old, Lagrangian, approach to the calculus, and Cauchy next attempted, rather disin- 
genuously, to deflect these attacks in a memoir he gave to the Turin Academy of 
Sciences. In it he claimed that his methods were necessary to help reduce the labours 
of astronomers. That is unlikely, but by setting out the principles underlying his 
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theory of the expansion of functions in power series his work now began to change 
the opinions of his Italian colleagues. 

Cauchy was aware of a fundamental connection between complex functions and 
power series expansions, but he struggled for some twenty years to formulate it 
precisely. In his Cours (1821, 151-152, 280-282) he had shown how to determine 
the radius of convergence of a given (real or imaginary) power series. His first attempt 
at the deeper result, that a complex function can be represented on some domain by 
a convergent power series, was presented in a Mémoire he presented and published 
in Turin (Cauchy 1831). There he said that: 


A function f(x) can be developed (by Stirling’s theorem) in a convergent series of ascending 
powers of x when, the modulus of x being less than or equal to p, the function f(x) remains 
finite and continuous for the modulus p or any modulus smaller than the real or imaginary 
variable x. 


This is inadequate.! The theorem is formulated only for a neighbourhood of 
the origin, and, more damagingly, the function was required only to be “finite and 
continuous”. This vague phrase, often modified in the course of a proof, was used 
by Cauchy to mean that the function was never infinite and was not multi-valued. 
But even this is not enough because no condition was imposed on the derivative— 
indeed it does not even require that the function have a derivative. Moreover, there 
is a considerable difference between real and complex differentiability, as perhaps 
Cauchy might have suspected: his famous example of a function that is zero when x 
is negative or zero and equal to e~!/ x” when x is positive shows that a real function 
may be infinitely differentiable and still not equal its power series expansion. But if 
Cauchy’s loose way with words can annoy us today, it is nonetheless the case that 
his attention to rigour, even here, greatly exceeded that of his contemporaries—as 
its mixed reception indicates.” 

It is helpful to look ahead briefly, to 1839 (when Cauchy was back in Paris), 
because Cauchy restated the Turin theorem on several occasions in the 1830s. In 
particular, he applied it to the solution of differential equations when he proved the 
existence of solutions to what we nowadays call the “Cauchy problem’ by expanding 
them as convergent series. Then in 1839 (Cauchy 1839, 486) he announced a new 
version of the theorem. Now he claimed that a complex function had a power series 
expansion whenever the modulus of the variable x remains less than the smallest 
value at which the function or its derivative cease to be finite and continuous. As 
Cauchy explained in a note he added to the paper, this hypothesis on the derivative 
f(x) was needed to make sure that one can change the order of the integrations used 
in the proof. 

He argued that when the variable x takes the imaginary value z = Ze? v—! one 
has the identity (which we would call the Cauchy—Riemann equations!) 


'See Peiffer (1978, 11-12). 


Cauchy found a more sympathetic audience in Piola, who translated Cauchy’s Turin Mémoire into 
Italian and published it in the Opuscoli that he edited (Cauchy 1834). 
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Of@ 1s OF) 
OZ  Z.J=1 Op 


Integrating twice with respect to Z and p he obtained 


/ f(2)dp = 2 f 0) 


and, under the further hypothesis that f(0) = 0, that ie "_ f(z)dp = 0. The theo- 


rem then pales on substituting 7 GE for f(z) (assuming x 4 z, |x| < Z), 
developing —- in a power series, and ‘finally by integrating the result term by term. 

Cauchy’s ae of functions, continuous together with their first derivatives, 
were Cos Z, sin z, and e*, which therefore could be expanded in power series for any z. 
Whereas, according to Cauchy, functions like ./1 + z, a log( + z), and arctan z 
cease, with their first order derivatives, to be continuous functions of z at the moment 
when the modulus becomes equal to |. Therefore their power series are divergent 
when the modulus is greater than 1. Finally, functions like e!/<, e!/ an and cos(1/z) 
were discontinuous at z = 0 together with their derivatives and consequently not 
capable of expansion in power series. 

So still in 1839 Cauchy was still insisting on continuity, rather than differen- 
tiability, as the crucial property for a function to admit a power series expansion; 
differentiability appeared as an artefact of the proof. In a second proof he gave of the 
theorem, he even claimed that his theorem (1840, 331) “immediately gives the rules 
for the convergence of series provided by the development of explicit functions and 
reduces the law of convergence to the law of continuity.” 

In 1833 the Bourbon Court had moved to its domains in the Czech lands, and 
Cauchy was called to Prague to act as tutor for the son of Charles X. There, in 1834, 
he was joined by his wife and family. His royal duties were to leave him little time 
for research or publication until 1839 when his tutee, the Duke of Bordeaux, turned 
18, and they came to an end. Cauchy, who turned 50 that year, was now free to move 
back to Paris permanently and to pick up the career he had walked away from in 
1830. 

He returned to an uncongenial place. Many of his older colleagues, among them 
Ampére, Fourier, and Legendre, had died; Poisson soon followed them in 1840. 
A new generation had grown up who had not been taught by Cauchy and did not 
know much, if anything, about the papers Cauchy had published while abroad. So 
he decided to republish the results that he had obtained in Turin and Prague, as well 
as his more recent results, by resuming his Exercises under the new title Exercises 
d’analyse et de physique mathématique. He also used the Comptes rendus des séances 
of the Paris Academy, recently established in 1836, to publish a note or a memoir 
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almost weekly, a number of which dealt with topics in complex analysis to which 
we now turn.? 

It is these papers in which Cauchy set out to rewrite his theory of functions of a 
complex variable and establish it securely for the first time. 


10.3 Laurent’s Theorem 


On 28 November 1841 Cauchy considered the uniqueness of a power series expansion 
that contains negative powers of the variable. He noted (Cauchy 1841, 359) that he 
had proved uniqueness when no negative powers are involved in his Cours d’analyse 
(Cauchy 1821, 144-145), and that that proof ceases to be valid when negative powers 
of the variable occur. Now he stated and proved that 


1. if the sum of a convergent series of positive whole powers of the variable is 0 
then all its terms are equal to 0; 

2. if the sum of a convergent series of positive and negative whole powers of the 
variable is 0 for a given modulus of the variable, then all its terms are equal to 0. 


From (1) and (2) the following uniqueness theorems could be easily derived: 


3. a continuous function can be expanded in a convergent series of positive, whole 
powers in a unique manner [one can only wonder what Cauchy meant by the word 
‘continuous’ in this setting, and what degree of confusion there might have been 
in his mind]; 

4. acontinuous function can be expanded a unique manner in a convergent series of 
positive, null and negative whole powers of the variable for a given value of its 
modulus and any value of its argument. 


Cauchy proved theorems (1) and (3) using the arguments he had used two decades 
before in his (1821). To prove Theorem (2) he multiplied both sides of the equation 
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ag +ayx+ anx? See a_\x7! + a_2x” 
by e’”?dp and then integrated with respect to p between 0 and 27, thus obtaining 
an for any positive, null or negative n. From this theorems (2) and (4) followed 
immediately. 

Other matters then occupied him, and he only returned to the topic in a paper 
(Cauchy 1843a) that he presented to the Academy on 31 July, 1843 when he gave an 
“elementary” proof of his uniqueness theorem (4) (one, that is, that did not resort to 
integration). 


3In this period, and for the rest of his life, Cauchy published something almost every week on 
whatever subject had caught his attention. An indication of their quality is provided by a letter 
Jacobi wrote to Dirichlet that is quoted in Schubring (2005, 430): “Cauchy has become unbearable. 
Every Monday, broadcasting the known facts he has learned over the week as a discovery. I believe 
there is no historical precedent for such a talent writing so much awful rubbish.” 
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This paper led Pierre Alphonse Laurent to submit to the Academy a “fragment” 
of a more extended work. Laurent was former pupil of the Ecole Polytechnique, who 
had graduated in 1832 as one of the best students of his class and eventually worked 
as a military engineer in Algeria before returning to France around 1840. He now 
became interested in mathematics, and submitted a paper to the Academy. Cauchy 
and Liouville were charged with reviewing it, and although they recommended 
that the paper be published the Academy did not accept the recommendation and 
Laurent’s original memoir was never published. Its principal results were included in 
the (posthumous) article (Laurent 1843). When the same misfortune befell a second 
paper Laurent turned to applied mathematics, until his untimely death in 1854.4 

However, at the next meeting of the Academy Cauchy (1843b) claimed that his 
papers (Cauchy 1837a, b) gave him priority over Laurent in the matter. Only on 
30 October did he present his (and Liouville’s) report on Laurent’s memoir to the 
Academy. Cauchy then went on to argue that the main theorem in Laurent’s paper 
could be deduced from a theorem he (Cauchy) had stated in Cauchy (1840, 186-187) 
and elsewhere. To support his claims Cauchy added to the report a note of his own 
(Cauchy 1843c) to show that “the easiest way” to obtain Laurent’s theorem was by 
reformulating the results in Cauchy’s (1840). If Laurent’s ideas can be disentangled 
from Cauchy’s presentation, it would seem that he had realised that a function f (x) 
that is ‘finite and continuous’ whenever |x| lies between two limits (i.e. in an annulus) 
can be expanded as a sum of two convergent series, one in ascending and one in 
descending powers of x. From this, Laurent had deduced results about the separation 
of the roots of an algebraic equation. Sadly, Cauchy’s less than generous response, 
coupled with his extraordinary rapidity of thought, was to prove typical of the way 
Cauchy would take the work of others as an opportunity to promote his own. 


10.4 Cauchy’s Path to the Cauchy—Riemann Equations 


In the crucial year 1846, albeit in a vague and incoherent manner, Cauchy was 
acquainted with the basic facts about complex functions. He had restated the integral 
theorem in a more general setting, he was adept at using the concept of a residue and 
the integral formula (see below). Yet a comprehensive theory of complex functions 
was still lacking, and an understanding of multivalued functions and their behaviour 
near branch points continued to elude him. More surprisingly, he was still far from 
clear what was the best way to explain what a complex variable might be. 

His use of paths in the (complex) plane should surely have rendered inadequate 
even in his own eyes the conception of complex quantities that he had given in 
his Cours (1821a)—but which he had apparently not yet abandoned—in which he 
had regarded imaginaries as symbolic expressions. However, in 1847 he proposed 
to make this theory more palatable by reducing the letter 7 to no more than a real 
quantity. In his opinion 


4See his obituary in Bertrand (1890). 
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This done, there is no need to torture the mind to discover what the symbolic sign ./—1 
could represent, for which German geometers substitute the letter 7. 


The symbol was merely a tool, he continued, that enables the mathematician to 
quickly reach the real solution of questions. But, he continued, the theory would 
become even clearer if everything, including the letter 7, was reduced to real quanti- 
ties. (Cauchy 1847a, 313). So, inspired by Gauss’s and Kummer’s work in algebraic 
number theory, Cauchy called two real polynomials equivalent if they give the same 
remainder when divided by i? + 1. Thus, to use much more modern terminology, 
calculations with complex numbers could be reduced to calculations (mod i7 +1) 
in the ring R[i] where i was ‘a real but indeterminate quantity’. The meaning of the 
term ‘real’ here is distinctly ambiguous. 

In an expanded version of this paper (1847), published as usual in his reborn 
Nouveaux exercises de Mathématiques, Cauchy extended the theory of equivalences 
to functions defined by means of convergent series. By considering, for example, 


the exponential series e* = >° a” and writing ix in place of x, he rapidly obtained 
Euler’s relation: 


nn} 
e'* = cosx +isinx mod(i* + 1). 


It took two more years until, “after new and arduous reflections”, as he put it, 
(Cauchy 1849, 152) he changed his mind again and came out in favour of the theory 
of geometric quantities, that is to say, the interpretation of complex numbers (and 
variables) in the complex plane adhered to by most of his contemporaries. In support 
of this view he cited several of its advocates, including Buée (1806) and Argand, but 
he mentioned neither Hamilton (1837) nor Gauss (1831). He said instead that he had 
been led to his ‘new’ conception by using the ideas that Saint Venant had presented 
in a note on geometric sums published in 1845. But now, in this geometrical setting, 
Cauchy began re-establishing the results he had been obtaining since the 1820s. 

Cauchy began to introduce the general concept of a function Z(z) = X + iY 
of a geometric quantity z = x + 7y rigorously in a way that he had failed to do in 
(Cauchy 1821a) when writing about imaginary functions of imaginary expressions. 
As he now put it (Cauchy 1853, 359-360): 


When one substitutes geometric quantities for imaginary expressions, the imaginary variables 
are nothing other than variable geometric quantities. It remains to know how functions of an 
imaginary variable can be defined. This last question has often embarrassed geometers, but 
every difficulty disappears when, guided by analogy one extends to functions of geometric 
quantities the definitions generally adopted for functions of algebraic quantities. In this way 
one initially comes to some singular conclusions which are nonetheless legitimate, as I 
shall indicate in a few words. Two real variables, or, in other terms, two algebraic variable 
quantities, are said to be functions, the one of the other, when they vary simultaneously in 
such a way that the value of the one determines the value of the other. [In this way] Z will 
be counted as a function of z when the value of z determines the value of Z. Now, it suffices 
for this that X and Y will be functions determined by x and y. 


He extended the concept of continuity to these functions. He began by recognizing 
that 
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in truth, the definition of a continuous function given in this work (Cauchy 1821) and gen- 
erally adopted today by geometers, is particularly applied to real or imaginary functions of 
real variables, but nothing prevents it being extended to the case where the variables are 
geometric quantities (Cauchy 1853, 367). 


Before doing so, in order to preserve single-valuedness in the case of multi-valued 
functions he made it clear that “it was easier” to consider each branch of the function 
as a separate, well-determined function of the variable. Then he repeated word for 
word the definitions he had given in his (1821, 34-35), merely translating them into 
geometrical terms (Cauchy 1853, 376). 

In spite of Cauchy’s claim, as Felice Casorati later remarked (1868, 71) a gen- 
eralization of this type was not suggested by a “truly profound analogy”. Analogy 
alone could only propose a theory of functions of two real variables; the crucial 
identification of functions of a single complex variable involved a new idea. 


It is thus that Cauchy himself, while conserving the given definition of a function of a 
complex variable x + iy, found it necessary to introduce an epithet to designate, among all 
the functions comprised in the definition, only those which, like all the functions resulting 
from ordinary systems of operations performed on the combination x +iy, enjoy the property 
of having for every value of x + iy a derivative independent of the value ay 


The epithet Cauchy introduced was ‘monogenic’, meaning that the real and imag- 
inary parts of a complex function satisfied the Cauchy—Riemann equations. One 
almost feels like giving a sigh of relief at this point. 


10.5 Final Stages 


All this time, multi-valued functions, such as the logarithm function and the square 
root function, continued to trouble Cauchy, and he tended to deal with them by cut- 
ting up the domain and treating each branch separately Freudenthal (197 1a, 141) has 
aptly remarked that “from 1821 he [Cauchy] treated multi-valued functions with a 
kill-or-cure remedy: if branched at the origin, they would be admitted in the upper 
half-plane only [although] according to Freudenthal, “fortunately he more often 
than not forgot this gross prescription, which if followed would lead him into great 
trouble,...”. Indeed, Cauchy’s flood of publications contains a large share of refor- 
mulations and repetitions, and, as Belhoste documented in his biography of Cauchy, 
his failure to present a coherent version of his theory of complex functions in one 
place caused difficulties for his contemporaries. Joseph Bertrand, in his review of 
Valson’s biography of Cauchy, which was filled with unstinting praise for both the 
man and his work, remarked that 


Cauchy, throughout his career, preserved both his rapidity of thought and his powers of 
invention and penetration. His genius always made him the master of the most difficult 
problems in a few moments. But every research involves trial and error and fruitless attempts, 
which Lagrange, Jacobi, and Gauss doubtless knew as well as him. What distinguished 
Cauchy, whose genius was without doubt equal to theirs, was that he informed the public of 
them minutely and at length. 
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Only as late as 1851 did Cauchy begin to realize fully that the crucial concept 
involved in his theorems on complex functions was not continuity but analyticity 
related to Cauchy—Riemann equations. The explicit definition of monogenic func- 
tions permitted Cauchy to specify the classes of functions to which his theory could 
be applied and to completely relieve his uncertainties about the correct formulation 
of his theorem on the expansion of functions in series. Referring to “the necessity 
of mentioning the derivative of a function of z” (Cauchy (1851a, 304)) he began 
to approach the correct hypothesis one has to require i.e. the existence of a unique 
derivative f’(z) (independent from the path). This condition is enough to ensure the 
analyticity of the function.> 

For monogenic and monodromic functions, Cauchy then reformulated his previ- 
ous results including his integral theorem and the calculus of residues. 


10.6 A Summary of Cauchy’s Ideas on Complex Analysis 


In his Mémoire of 1825 on complex functions, Cauchy had defined the integral of 
a complex function along a path in the complex plane, and showed that it was the 
same on two paths that did not enclose an infinity of the function. 

He obtained his integral theorem and his integral formulae for the case when a 
function is infinite at a point p inside a 


j= f@+yV=) = ef (x + yV—1) = 20/1 Res(f). 


rect 


Recall from §6.3 that when Cauchy had first considered the case when the function 
fat y./—1) becomes infinite for values x = a, y = b (in 1825) he had considered 
the limit, denoted f*, of f = lim(x —a+(y—b)J/—1) f(x + yV—1) as x tended to 
a and y tended to b or, as he put it, without noticeable error f* = ef (x + yV—1), € 
being an infinitely small number. The formula extends by simple addition to finitely 
many points in the rectangle where the function becomes infinite. 

In the case of higher order infinities (or, in modern terms, of a pole of order m) 
his integral theorem became 


f™Va+tby/=l 
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The integral formula is the result of applying the integral theorem to the integral 


f Oe @) ide where f is an analytic function inside and on the rectangle. The formula 


rect 
says 


5It was a considerable step forward when in 1900 Goursat proved the Cauchy integral theorem, 
where he stated that “by assuming Cauchy’s point of view it is sufficient to build the theory of 
analytic functions, to suppose the continuity of f(z) and the existence of the derivative’”—note, not 
that the derivative is continuous (Goursat 1900, 16). 
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It yields formulae for all the derivatives of the function f at the point ¢ and opens 
the way to a Taylor series expansion of an analytic function, so once Cauchy was 
clear what the definition of a complex analytic function was (and that it requires that 
the function have a continuous derivative) he could show (in 1851) that a complex 
analytic function has a Taylor series expansion. 


Chapter 11 
Complex Functions and Elliptic Integrals 


Joseph Liouville (1809-1882) 


11.1 Introduction 


This chapter considers how elliptic functions and complex functions were first 
brought together. This was an important step for both subjects, which, as Jacobi 
noted in his lectures, seemed to be kept apart by the complications resulting from 
the two-valued nature of the integrand in the elliptic integrals. 
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11.2 Liouville and His Student Hermite 


The prime mover was Joseph Liouville, who had already shown himself to be a 
versatile mathematician, and was the editor of a successful journal. In 1843 he was 
reading the first edition of Abel’s Oeuvres, and discussing the theory of elliptic 
functions with Charles Hermite, who was then a student of his. Liouville recorded 
in one of his working notebooks that Hermite had shown him that there cannot be 
a function with two distinct real periods. For, if there is such a function it has a 
Fourier expansion with respect to one of the periods. The existence of the other 
period determines properties of the coefficients that quickly show that the function 
must be a constant. Hermite would have found a similar argument in some papers 
by Jacobi that he was studying intensely at the time. 

In his notebook, Liouville argued by contradiction.'. He supposed that the function 
f (x) has two real periods a and 7. With respect to a the function can be represented 
this way: 


2i7 2imx 
A Ai Bi 
f@wm= ot Dd cos (=2* - a2 sin = ) 
Periodicity with respect to @ means that 
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and so 
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By equating the ith coefficients and adding the squares of the resulting equations, 
Liouville found that 


2i 2in B\? 
(4? + B?) (sw ne + (1 cos =) ) = 0. 
a a 


The ratio a/(@ must be irrational, or a and 3 would both be integer multiples of 
a smaller period and would not therefore be distinct. So, when i 4 O it must be the 
case that A; = B; = 0, and the function reduces to a constant. 

Liouville soon saw that this proof establish a much more interesting result: no 
doubly periodic function is everywhere finite. For, let such a function f (x + iy) 
with a period a be expanded as a series in x of the form 


'See Liitzen (1990, Chap. 13) 
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If @ is also a period, then 


2i 2i 2i 
Aj cos( = +2) =A; cos ( = +ej+ =") 
a a Qa 


So if a/@ is irrational and i 4 O it must be the case that A; = 0, and again 
the function is a constant.” This result is the special case of Liouville’s theorem for 
doubly periodic functions. 

Liouville found the generalisation of his principle from doubly periodic functions 
to all complex functions as early as 1844—but he did not publish it.* 


CHARLES HERMITE 


Charles Hermite (1822-1901) 


Hermite’s contribution was thought to have been lost for a long time, and was 
known about only through of a memoir he presented to the Academy of Sciences 
six years later, in 1849, where he presented a theory of doubly periodic functions in 
terms of Cauchy’s theory of functions of a complex variable. However, his original 
manuscript was found by Belhoste in 1988 and published some years later (Belhoste 
1996). 


?As Liitzen has pointed out out in his analysis of Liouville’s work, (1990), Liouville’s argument 
extends readily to any doubly periodic complex function. 


3See Liitzen (1990, Chap. 13). 
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In his memoir, Hermite attempted to show that every doubly periodic function 
is the quotient of two singly periodic functions (with the same period) which are 
defined everywhere, and so the function is of the form 


> Am exp (2m Tx) 


m 


> Bn exp (2m Tx) 


m 


He could make no progress with the idea that the quotient also be periodic with 
period b except in special cases. So he turned to study doubly periodic functions 
of the form F (x) = MQ)’ where the functions I (x) and ® (x) are periodic with 
period a. 

The values of such a function are known everywhere when they are known on a 
period parallelogram, and Hermite integrated the function around a period parallelo- 
gram to obtain expressions for the Fourier coefficients of F. He found the important 
result that the sum of the residues of a doubly periodic function in its period parallel- 
ogram vanishes. Then he showed that the doubly periodic function F can be written 
in the form 


2iT 
Ag+ — trie & — 21) + 2p & ~ 22) to + np — zn)} 


where the r; are the residues of F at the points z;, j = 1, 2,...,m and the function 
p(z) is periodic with period a. It is not a theta function, but is required to satisfy 
y(z + b) = y(z) — 1, a condition that makes the function F doubly periodic—that 
b is a period follows from the fact that the sum of the residues vanishes. 


11.3 Liouville on His Theorem for Doubly Periodic 
Functions 


In December 1844 Liouville informed the Académie des Sciences that he had found 
a general principle that seemed to him to give the theory “an uncommon character 
of unity and simplicity”: 


If a (single-valued) function is doubly periodic, and if one recognises that it never becomes 
infinite, one can, from this alone, affirm that it reduces to a constant. 


But he did not publish a proof of this remarkable fact, nor was he to. Perhaps fool- 
ishly, he confined himself to giving private lectures on it. Among his audience when 
they visited Paris in 1847 were the German mathematicians Carl Borchardt and 
Joachimstal, who on Liouville’s instruction gave copies of their notes to Jacobi and 
Dirichlet on their return home. Borchardt eventually published his notes of these 
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lectures in 1880, by which time he had become the editor of the Journal fiir die reine 
und angewandte Mathematik.* 

Cauchy recognised the generality of the principle at once, and at the very next 
meeting of the Académie announced this theorem: 


If a single-valued function reduces to a determinate constant F for every infinite value of z, 
then it will reduce to the same constant value when the variable z has any finite value. 


This needs a little interpreting. Cauchy meant that a complex function of a complex 
variable (a concept he had still not precisely defined) which is defined everywhere 
and is never infinite (it has no poles), and which it is bounded in a neighbourhood 
of infinity, reduces to a constant. Such behaviour is a strong and useful constraint 
on the global behaviour of complex functions that cannot apply in general to similar 
functions from R? to R?. 

Cauchy gave a proof, based on his calculus of residues, and seemed to suggest 
that he had possessed the means for publishing this theorem since at least 1831. His 
remarks amount to a claim that this result, today known as Liouville’s principle, 
should really be credited to Cauchy, since Liouville only applied it in the special 
context of elliptic function theory. In support of this claim, Cauchy went on to give 
five different proofs of Liouville’s principle in under a year. 

Thus provoked, Liouville decided to lecture on his work at the Collége de France 
in 1851. On this occasion he argued that a bounded complex function f is constant 
by arguing that if f takes its maximum value at a point zo say, then the Cauchy 
Integral Formula implies, on setting zo + re’? = z, that 


20 20 
rool =|>- | (core?) aol = f = |p (cotrel”)| a0 < 1s eo). 
0 0 


(11.1) 
which is impossible. 

In the audience were two young French mathematicians in Cauchy’s orbit: Charles 
Briot and Claude Bouquet. They decided that the time had come to write a book on 
the new theory of complex functions that Cauchy had finally got round to organising 
and publishing, and to show how the exciting and better-known theory of elliptic 
functions could be fitted into that framework. Their book appeared in 1859, with 
generous acknowledgements to both Cauchy and Liouville, and because it was the 
first textbook in the world to present the new theory, it did much to spread Cauchy’s 
and Liouville’s ideas. Liouville was not best pleased, however, and later wrote in his 
notebooks : “MM. Briot and Bouquet, cowardly thieves, but the most worthy Jesuits. 


28)). 


“Tn an otherwise faithful publication he chose to replace Liouville’s original proof with one of his 
own, based on the calculus of residues, as he said in an introductory footnote. 
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The controversy continued when Cauchy reported on Hermite’s paper on 31 March 
1851. He told his audience of the close connection that Hermite had established 
between elliptic functions and doubly periodic functions, and the result that the 
sum of the residues of a doubly periodic function inside its period parallelogram 
is zero. Liouville stood up and protested that he had had such a general theory of 
doubly periodic functions since 1844. As he explained, this theory was based on the 
general principle that a doubly periodic function must have at least two poles in its 
period parallelogram unless it is a constant. (As the lecture notes taken by Borchardt 
establish, this is equivalent to the principle Liouville had asserted in 1844, but it 
is not the same.) What Liouville had noticed was that if a given complex function 
f without poles is composed with the doubly periodic function sinam then the 
composite function f (sin am(z)) is constant, and so f itself must be constant. This 
reduces the general case to that of a doubly periodic function without poles, which 
was known to be constant. 

In conclusion, it would seem that priority in the discovery of the theorem resides 
with Liouville, who however, to his subsequent distress, did not publish it, so for once, 
the attribution of a name in the history of mathematics seems reasonably justified. 
But it was Cauchy who extracted Liouville’s principle from its original setting in the 
theory of elliptic functions, and gave it a much more natural proof within a theory of 
complex functions. His protestations of priority are also somewhat paradoxical. The 
more he showed how automatically it followed from his work twenty years before, 
the odder it looked that he had never noticed it before. One must suppose that a global 
principle of this kind lay outside his way of conceiving the subject, which was local 
and rooted in the study of power series. 


11.4 Doubly Periodic Functions and Complex Functions 


As we saw with the work of Gauss in Sect. 9.5 there would seem to be two theories: 
one concerns the functions obtained by inverting an elliptic integral with real or 
complex modulus, the other doubly periodic functions with two arbitrary periods 
whose ratio is not real. The obvious question is whether these theories are coextensive. 
But there were difficulties in the way of inverting elliptic integrals with a complex 
modulus, and equally there were problems in finding a suitable modulus given a pair 
of periods. 

Liouville’s theory of elliptic functions, as eventually published by Borchardt and 
Joachimstal as (Liouville 1880), investigates the problem of showing thatdoubly 
periodic functions are elliptic functions. In the first half of the memoir Liouville 
showed that, for a given pair of periods, every doubly periodic function with those 
periods is a rational function of a doubly periodic function with those periods and 
exactly two poles. This reduces the problem to that of realising a doubly periodic 
function with only two poles as an elliptic function. It would then be necessary to 
show thatthere are doubly periodic functions having any pair of periodsw; and w2 


11.4 Doubly Periodic Functions and Complex Functions 121 


(assuming only that the quotient of the periods is not real) and that there are elliptic 
function with the same periods, and to find the corresponding modulus. 

However, Liouville did not fully investigate these questions. He began by showing 
that if there is a doubly periodic function ¢ (z) with periods 2w; and 2w2, poles a 
and (3 , and zeros a and b (when, necessarily, a + b = a+ (9) then (z) satisfies the 
differential equation 


{¢' (Q}° = Ad* (2) + BE? (2) + CH (2) + DOD) HE (11.2) 


A, B,C, D, and E being constants. If moreover a = £3, then, he said, A = 0. 
To construct a doubly periodic function with arbitrary periods it would then be 
necessary to examine how the periods of ¢ depend on the constants A, B,...E. 
All that Liouville showed, however, was that if ¢(z) is an even function (that is, 
o(z) = (—z)) then the function ¢(z) is a constant multiple of either sin am(z) or 
its reciprocal. For in that case, as he showed, the differential equation reduced to 


= (¢' O) {1-# @} {1-Pe @] (11.3) 


where 
1 1 


i= G=a, a+b," 
a+) o(-3-) 


It is true that the connection between the periods 2w and 2w 2 and the constants 
A, B,..., E would be difficult to elucidate.> But Liouville settled for showing that 
his theory embraced the theory of Abel and Jacobi. Perhaps he took the view that 
the existence of doubly periodic functions with arbitrary periods was beyond doubt. 
But while he advanced the subject of doubly periodic functions, he did not unite it 
with the older theory of elliptic functions. 

In this context Jacobi’s comments of 1847, made after Borchardt had returned to 
Germany, are most interesting. He wrote (Jacobi 1847, 519): 


Liouville and Hermite, in works of which only a suggestion has been published, have derived 
all the results about addition, multiplication, transformation, and division from the purely 
periodic properties of the transcendents ...One can say that ...the new and true foundations 
for the theory of elliptic functions have been found. Apart from the simplicity with which 
the entire structure can now be presented, it accords with the advantage of greater strength 
and clarity than was possible up to now in those treatments that began with integrals. For 
the theory of imaginary values of algebraic functions and their integrals is still not built up 
enough for the consideration of such integrals, over the entire domain of real and imaginary 
values with both constant and variable magnitudes equally involved, to be taken as basic. 
This shortcoming is not intrinsic to the method which served me in the Fundamenta Nova, 
but goes back to a gap in the general method of analysis. 


>The mathematician who pioneered this route successfully was Weierstrass in the 1860s. 
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In the event, it was Briot and Bouquet who brought elliptic function theory and 
complex function theory together in a series of three papers (Briot and Bouquet 1856a, 
b, c) and a book (Briot and Bouquet 1859). The papers approached the subject through 
the theory of differential equations, and the book can count as the first exposition of 
Cauchy’s theory of analytic functions, although it is not without significant mistakes.° 


See Bottazzini and Gray (2013, 4.6). 


Chapter 12 
Revision 


A good way to begin to think historically is to try to imagine what things looked like 
at some date in the past without the benefit of our modern knowledge. After all, that 
is how everyone must proceed in their own present—no-one can know the future 
with certainty, we must all take risks. I have put a generic question appropriate to 
such an approach at this stage in the course at the end of this chapter, along with 
some advice about how to tackle it. 


Chapters 1-3 


The first three chapters are scene setting for the chapters that follows. To consolidate 
your study, it is not as important to follow the mathematics in detail as it is to follow 
the arguments in outline, identifying the key points and assessing their historical 
significance. 

When Lagrange criticised the poor state of the foundations of the calculus this 
was not the complaint of an outsider such as Bishop Berkeley but the considered 
opinion of the leading ‘pure’ mathematician of his day. He followed it with his own 
account of the foundations, and so gave the problem and his solution a heightened 
visibility in the mathematical community. There are two things to note about his 
solution: it is entirely algebraic, making no reference to infinitesimals, limits, or any 
other intuitive or geometrical argument; and it would suggest that every function is 
infinitely differentiable and can be given a power series expansion about any point 
in its domain. No-one objected to this conclusion, least of all Lagrange himself, so, 
knowing what we know today, we have to wonder why. 

Possible answers are overlapping and partly contradictory. Itis clear that Lagrange 
himself admitted that sometimes things can go wrong. The theorem is false for the 
square root function y = ./x at the origin, for example. But, Lagrange would say, 
of course it is. Theorems are accompanied by small print, their obvious exceptions. 
We deal in the same way today with statements about the natural or social world 
and allow that statements are true when they are only true most of the time (the 
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claim that ‘Anyone can run a mile in under twenty minutes’ is not refuted by the 
observation that some people cannot run at all). Lagrange’s theorem was taken to be 
true in general, obvious exceptions aside. There may be isolated points where the 
theorem is wrong, but quite generally the claim is correct. 

And indeed, what would you say a typical function was? What would Lagrange 
and his contemporaries have regarded as a good example to test out a theory on? 
What sort of object would it be: a graph drawn by ‘the free motion of the hand’? A 
finite algebraic expression? An infinite series, and if so, of powers of x or in terms of 
sines and cosines? A graph would presumably be capable of being expressed in some 
formalism, say that of infinite series, so the question becomes: what are they like? 
And, since the calculus is a local operation, what in particular are they like on very 
small domains? It is not unreasonable to believe that in fact all functions do have 
power series expansions, and no counter-examples to this general rule were known. 

The same conclusion arises when we look at Fourier’s work. There is more geom- 
etry here, but the same confidence that the calculus applies everywhere and all the 
time except where obviously—and trivially—it cannot. That attitude is a feeling 
that nothing in nature, and nothing in mathematics, obstructs the application of the 
calculus. It is then a matter of taste what one does with this feeling: for the ‘purer’ 
mathematician, Lagrange, it is better to have a proof; for the more ‘applied’ math- 
ematician, Fourier, is is simply a reliable basis for getting on and doing good solid 
work. 

As for Legendre’s work on elliptic integrals, it was cause for an outright scan- 
dal that mathematicians could not integrate f i ee but we shall not be 


pursuing the resolution of that matter as a piece of real analysis, because that is not 
how mathematicians took the story up. It is worth noting, however, what Legendre 
did do with his lengthy investigations over the course of a productive life. He found 
numerous applications for the new functions he introduced, of which the pendulum 
and the rotating solid were and are obviously important. And to do so he evaluated 
tables for his new functions, by an ingenious argument. It is striking how few familiar 
functions there were at the time (sin, cos, log, exp, and a few more) and to be any 
use they have to be useful numerically in concrete problems. This raises a valuable 
question about any question in mathematics: what counts as an answer? One very 
good way to know a function is to know a table of values. (A graph is another, some 
general property of it a third, its behaviour for large values of the variable a fourth, 
and so on.) 

In looking at this very important and highly regarded work by three major mathe- 
maticians we can also see a very interesting fact about mathematics as it is practised 
that will help us to understand the subject historically. A possible judgement of this 
work is that Lagrange was simply wrong, that Fourier was naive, and that Legen- 
dre missed the true significance of what he had achieved. Our problem with such a 
judgement is that it is historically crass, and it is worth thinking why. 

First, we must admit its kernel of truth (most dangerous falsehoods have a kernel 
of truth). Lagrange was wrong that the calculus can be given rigorous algebraic 
foundations along the lines he suggested; Fourier was wrong, and certainly naive, 
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in believing that every function equals its Fourier series; Legendre did miss the idea 
of inverting the elliptic integrals he studied and then treating them as functions of a 
complex variable. 

However, a moment’s thought reveals how useless such criticisms are. If a math- 
ematician of the calibre of Lagrange, and everyone else of his generation, failed to 
come up with, say, Cauchy’s ideas then this better suggests how difficult it was to 
carry through Cauchy’s programme. As we noted, Grabiner has carefully described 
how much of the apparatus Cauchy used, in the form of an algebra of inequalities, 
was used by Lagrange. This did not suggest to people that an arithmetic defence of 
the calculus in terms of limits could be made to work, despite d’Alembert’s (and 
even Newton’s) remarks in that direction, and that says a lot about the magnitude of 
the change that Cauchy initiated. 

Similarly, Fourier promoted a powerful idea without worrying about the finer 
details. That is not what mathematicians are taught to do in the 21st century— 
although one can suspect that not every truly original piece of research these days 
is entirely watertight. Fourier had every reason to believe his approach would work 
in many cases. What the later work investigating under what conditions, and to 
what extent, a function does obey Fourier’s command tells us is the extent to which 
Cauchy’s introduction of the purely continuous function into mathematics was to 
change every one’s understanding of what a function is—and that was to be a century- 
long story if indeed it is finished even today. 

And yes, as Legendre was the first to admit, Abel and Jacobi found marvellous 
new things to say about the elliptic functions that can be extracted from the elliptic 
integrals he spent much of his life studying. But Legendre had done a great deal to 
create new families of functions, give them both a theoretical basis and a practical 
one (his tables of their values), and demonstrate how they arise in significant areas 
of mechanics. There is no reason to expect more of anyone. 

Just as we should not treat a famous mathematician from the past as a genius 
incapable of error, we should not treat them as failures for not seeing what later 
mathematicians saw, or what we value today. They did not say everything, and not 
everything they said was right: that is how research gets done. 


Chapters 4-6 


The three chapters on Cauchy’s work in the 1810s and 1820s make an interesting 
contrast, one that was highlighted by his professional situation. His work on real 
analysis is very carefully structured, his work on complex analysis rather disorgan- 
ised. But even to frame the comparison in those terms is misleading. Cauchy knew 
very well that there was a subject called the calculus that occupied itself with func- 
tions, differentiation, and integration. Even by 1830 it was only intermittently clear 
that there was a distinct, if overlapping subject that could be called complex analysis, 
and which occupied itself with a distinct class of functions. So he could accept the 
task of teaching the calculus and the consequent requirement to present his material 
in an orderly and instructive fashion and allow his research on complex functions to 
take him where-ever it led. 
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That said, nothing should obscure the remarkable degree of organisation in 
Cauchy’s two lecture courses of 1821 and 1823 and the high degree of novelty 
they contain. The traditional presentation of this material put up a defence of the 
fundamental principles of the calculus and then got down to the proper business of 
developing methods for solving problems: evaluating integrals, solving differential 
equations, and generally thinking with the calculus. Cauchy introduced his novel 
concept of limit, made much more use of the ¢, 5, N methodology (if only verbally), 
defined not only continuity but convergence, differentiable, and integrable, and there- 
fore had much more work to do to substantiate and articulate these new, or newly 
reformulated, ideas. He was confronted at almost every stage with the need to re- 
cast old ideas and techniques in a way that would fit in with the theory that he was 
creating, before he could get round to solving traditional problems. 

In the other half of this work, Cauchy was for much of the time occupied precisely 
with methods. How can you evaluate double integrals? When and why does the order 
of integration matter? Why, rather mysteriously, can it help to evaluate a real integral 
by passing to complex variables? So it was quite natural for him to consider, as he 
did, the meaning of an integral along a path in the complex plane first when the path 
is a side of a rectangle with sides parallel to the coordinate axes. In that context, 
guided by the fact that the value of a repeated integral does not usually depend on the 
order of integration, it is reasonable to suspect that the integral of a function taken 
around the whole boundary should vanish. It is also natural to suppose that when the 
integral round the whole boundary does not vanish (which is precisely the case when 
the order of integration does matter) that the function being integrated should display 
some singular behaviour. Put all this together, and it becomes entirely reasonable for 
Cauchy to think he had made tremendous progress with the problem at hand, that of 
studying real integrals by possibly complex methods. It is only a later generation, 
guided in part by Cauchy’s own ideas of the 1840s and 1850s, that would see all this 
work of the 1820s as a set of deep discoveries and missed opportunities. 

What of the other unexpected feature of his work, its mistakes? To be sure, they 
sit oddly with a man who has a reputation for introducing rigorous real analysis, but 
they sit rather comfortably with a man whose collected works, originally written over 
a period of some 40 years, fill 31 large volumes. One might suppose that someone 
writing so fast would occasionally miss important points. But in fact the issue of 
mistakes by mathematicians is worth thinking about. Some, such as the famous 
‘mistake’ concerning the behaviour of a convergent sum of continuous functions, 
remind us just how deep and subtle the behaviour of functions can be. It is quite 
tricky to say exactly what Cauchy failed to see, even in modern terms. This alone 
should guard us against the foolish idea that the ‘great’ mathematicians see clearly 
all the time. Their mistakes can sometimes show that there was still much left to be 
discovered. 

But there is another way in which Cauchy makes mistakes: he is prone to slip into 
his arguments assumptions that should have entered the formulation of his theorems. 
One could say that the attentive reader could (should?) simply rewrite the statements 
of such theorems accordingly, but the effect is disconcerting nonetheless and makes 
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one wonder whether Cauchy was not himself confused. It surely made it harder for 
Cauchy himself to see clearly what was important. 

It is likely that this habit or failing of Cauchy’s was tied to an important view 
about the nature of mathematics, one that we no longer share (at least when we put 
on our philosophical hats), which is that mathematics is true because it is an idealised 
description of natural objects. The process of idealisation is unproblematic, it renders 
to the mind tidied up, simplified versions of real existing things that we can reason 
about accurately precisely because they have been simplified. The same process 
we saw at work with Lagrange, whose confidence in the differentiable nature of 
functions was sustained by his acquaintance with them, applies to Cauchy with his 
wider acquaintance. Cauchy in the 1810s and 1820s was occupied with the study of 
real functions, which could be illuminated on occasion by introducing imaginaries. 
There might then be a need to spell out carefully what the ‘passage from the real to 
the imaginary’ involves and say what becomes of a function of a real variable in this 
process. But there may not be any apparent need to have, invent, or find a theory of 
complex functions of a complex variable. 

On this view one can sustain exceptions to a theorem: they do not refute the 
theorem until there is a better point of view that gives them greater weight. This 
explains why a palpable falsehood bothered neither Cauchy nor Abel, but a contrived 
example produced by Cauchy was destructive for Lagrange’s foundations of the 
calculus. The example, y = e7!/ x is a strange object. Its definition must of course 
be augmented by the requirement that y = O at x = 0, which makes it not only 
continuous but infinitely differentiable at the origin. Such a definition was entirely 
novel in its day, when all functions were defined by formal expressions. Why was 
this peculiar object not dismissed as another ‘exception’, if not indeed banished from 
mathematics altogether as too contrived? 

The most obvious answer, which is not as cynical as it may sound, is indeed correct: 
it was not marginalised because its discoverer did not want it to be marginalised. 
Cauchy wanted this function not only to show a weakness in Lagrange’s theory but 
to show a virtue of his own. It became a forceful example, indeed a counter-example, 
because it is part of another way of thinking about functions that has many other 
features to commend it. One reason a contemporary could see readily that there was 
going to be no way this function could be dismissed as belonging to the realm of 
trivial exceptions was that it belonged very naturally to another substantial theory. 

All of which suggests that one helpful way to follow the history of mathematics 
might be in terms of what mathematicians thought they were talking about, and 
discovering not only more things to talk about but better things to say about them. 
Mathematicians who believe, rightly or wrongly, that they know what they are talking 
about can look at new results and assess their worth as a naturalist does, in terms of 
their truth, how typical or important they are, and start to assess their implications 
long before they look at any proof. 


Chapters 7-11 


It is clear that the work of Abel and Jacobi has its origins in the work of Legendre. 
Legendre’s theory of elliptic integrals had established itself as a new branch in the 
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theory of functions of a real variable, one that enlarged the domain of what could 
be explicitly integrated and added to the list of problems in mechanics that could be 
solved. The contribution of Abel and Jacobi was to turn elliptic integrals into elliptic 
functions and to make the new elliptic functions a rich and interesting collection of 
complex-valued functions of a complex variable. This was a different, more concep- 
tual, and, if only because complicated, a more abstract kind of knowledge. Why was 
this regarded as exceptionally good mathematics, worthy of prizes and capable of 
making people’s reputations? Why was the sensible reaction not to say things like 
this: elliptic functions are useless, it is elliptic integrals that arise in actual problems; 
complex-valued functions of a complex variable are an abstraction made solely for 
abstraction’s sake and we need not (and therefore should not) go there? 

Such questions should always be asked of any new piece of mathematics. It is 
only healthy to ask, in an open-minded way, what is good and important about 
any new piece of work. Answers may be hard to come by: it is hard to predict the 
future and easy to make cheap claims for or against anything while waiting for real 
evidence to come in. The historian, of course, is more interested in reasons given at 
the time, not those with hindsight or those that ‘ought’ to have been given. In the 
present case, we know that Legendre was delighted to see his elliptic integrals given 
a new lease of life, and that both Abel and Jacobi had shown how well the new ideas 
solved problems raised by Legendre’s work and extended it. The new functions also 
had much more comprehensible properties than the corresponding integrals, and not 
only aricher collection too, but one that was made accessible by the analogy with the 
familiar trigonometric functions. It helped, of course, that existing and well-regarded 
problems elsewhere in mathematics were illuminated by the new ideas. 

It is harder to say what feelings were about the truly complex (as opposed to 
real) nature of the new functions. Algebra and analysis in the 18th Century had 
been no more involved with the complex numbers than was necessary, or perhaps 
opportune. At the start of the 19th Century there had been a flurry of interest in the 
philosophical ‘meaning’ of complex numbers. There was a renewed interest in proofs 
of the fundamental theorem of algebra, and Gauss, in his Disquisitiones Arithmeticae, 
had shown that there were good reasons for studying what are called cyclotomic 
integers. These are expressions involving ordinary integers and roots of unity in a 
way that kept many of the properties of the familiar integers—and of course one could 
ask: what was good about taking mathematics in that direction? It’s an open question 
as to whether a growing interest in complex variable and complex functions came 
before the work of Abel and Jacobi, was crystalised by it, or was created by it. In this 
connection the different attitudes of Gauss and Cauchy are worth thinking about, 
and then indeed one would want to know how these exceptional mathematicians 
influenced their contemporaries. Were they listened to, were they regarded as too 
difficult? 

If we look past 1835 we can begin to see what people did in fact do in response 
to all this mixture of work. We see new developments in the theory of complex 
functions, complicated by Cauchy’s wanderings round Europe, and better and better 
attempts to use the new theory to make sense of the great examples of complex 
functions, which were the elliptic functions. The obstacle that they all perceived was 
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the two-valued nature of the integrand in an elliptic integral, which seemed to take it 
out of Cauchy’s domain of ideas entirely; it is worth thinking about why this was so. 
Do you think this will turn out to be a permanent obstacle, or will it be overcome? 

The contrast between the bustling, energetic, and oddly solipsistic Cauchy and 
the quieter, more socially responsible Gauss is fascinating at every level. Purely 
metaphorically, I imagine Cauchy as the dinner guest who talks all the time, is 
frequently brilliant, and moves rapidly from one topic to another and back always with 
something to say. Gauss I imagine as a person of many fewer words, but, on balance, 
with the more difficult and deeper things to say. Gauss, more than Cauchy, saw that 
complex numbers and complex functions were an important new development, even 
if one wants to say that Cauchy did more for the subject in the end. 

But the fundamental question remains: what would you, as an aspiring mathe- 
matician in 1830 or 1835 have done with all this new work? Carried on as you were? 
Learned the new stuff at once? Perhaps it depends on how confident you were of 
your own merits, your own potential contributions. 


A Question 


Imagine it is 1835 and you are young British professor finishing a year or two studying 
mathematics abroad. You are writing a letter to a very good student who will take 
your place out here, advising them on what they should study when they arrive. 
Describe recent developments in EITHER the theory of functions of a real variable 
OR in the topic of complex functions and complex integrals. 


Advice 


Think hard about what constitutes good advice, as opposed to information. 

When you are offering an opinion or judgement (as in “How important was...” 
or ‘why was ....’) give a brief argument in support of your opinion. 

Distinguish between contemporary criticisms of someone’s work and your own 
judgements. Don’t be afraid of offering your own judgements, but do not say anything 
from the perspective of the present day that could not have been said in 1835. 

You do not have very many words, so anything you say about personalities must 
be essential. 


Chapter 13 
Gauss, Green, and Potential Theory 


13.1 Introduction 


The basic idea of a potential function is very simple. It applies to problems in dynam- 
ics where one is given a force, and the idea is to find a function f(x, y, z) whose 
partial derivatives give the components of a force. The function, if it exists, is called 
the potential function for the given force. It was introduced, without a name, by 
Lagrange in his study of gravitational attraction. The idea became even more impor- 
tant when Coulomb showed that electrostatic forces also obeyed an inverse-square 
law and then Ampére showed that there is also an inverse-square law in electrody- 
namics. It is in this fast-moving context that gauss and Green did their important 
theoretic work on potential functions in their 1830s and 1840s that is the subject of 
this chapter. 


13.2 Potential Theory 


First, some remarks about notation (for a revision of the theory, see Appendix C). 
In the 1830s and 1840s mathematicians wrote out the corresponding expressions in 
full. Today we have a good notation using symbols like V f and A, or terms like div, 
grad, and curl. These are not only compact, they allow the use of vector operations 
(the dot and cross product) and the use of vector identities. I have decided to use V f 
and A, but purely as abbreviations. 

We shall write the three partial derivatives of a function as (34 : ge 55) , denote 


it by V f, and refer to it as the gradient of f. We also write 


_ OV j av Vv 
~ Ox2 © Ay2 © Ay" 


AV: 
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Given a function f and a surface with an outward pointing normal n at each point, 
we write 
of 


aS and V f.n 


for the normal derivative of the function f/f. 

Lagrange introduced the idea of a potential function in 1773. His question was: 
how to determine the potential function for a given body, and thus its gravitational 
attraction? After Legendre had opened up the subject, Laplace showed how to find 
the potential function, V, for the gravitational force due to an arbitrary spheroid in 
1782 by assuming that it satisfied the differential equation 


20 ge oe 
~ 0x2 Oy? — Ay? — 


AV: 


which is why we speak of Laplace’s equation today. In the 1780s Charles Coulomb, 
a French military engineer, proposed the inverse-square law in electrostatics, which 
differs from the law of gravitation only because the force is one of repulsion and so 
a minus sign enters the formulae. In his efforts to produce a mathematical theory 
of electrostatics in 1813 Poisson was led to generalise Laplace’s equation to AV = 
—4rp, which applies also to points inside a solid (the function p is the density of the 
solid). 

These mathematicians had noticed that a force F(p) exerted on a unit mass at a 
point p by a mass distribution p(x) is given by the integral 


_ f p(x) = p) 
ro)= | oe 


This is a complicated vector expression, but Lagrange’s potential function is the 
much simpler scalar function 


p(x) 
V = dx. 
(P) = 


Direct computation shows that —VV(p) = F, and for the forces of gravity and 
electro-statics, as Laplace showed, AV = 0 wherever p = 0 and AV = —4rp 
wherever p 4 0. 

Among the reasons for wanting to study potential functions were the motions of the 
moon and more generally of the newly discovered asteroids. The moon was Gauss’s 
interest in 1813 when he studied homogeneous ellipsoids. He had other interests that 
we cannot consider in this course for reasons of space, notably the two-dimensional 
case when the corresponding potential function is called the logarithmic potential, 
and he was interested in the problem of the conformal mapping of one region on 
another in connection with geodesy and terrestrial map making. 


13.2 Potential Theory 133 


But the topic became even more interesting when it was shown that it applied to 
the new and much more mysterious processes of magnetism and electro-dynamics.! 
The leading figure here was the French mathematician André-Marie Ampére, who 
responded within days to Oersted’s experiments on electro-magnetism with a rush 
of activity in 1820. In experiments involving the magnetism produced by an electric 
current, the Earth’s magnetic field is a complicating factor. Ampére derived an exper- 
imental set up that eliminated this effect, and Oersted’s complicated findings were 
immediately simplified. By 1826 Ampére had written his definitive masterpiece, his 
Mémoire sur la théorie mathématique des phénoménes électrodynamiques, unique- 
ment déduite de l’expérience (or, Memoir on the mathematical theory of electro- 
dynamics solely deduced from experiment). 

Ampére offered a unified theory of electricity and magnetism in the form of math- 
ematical laws, and seldom ventured into the realm of physical explanations about 
the nature of electric current. Magnetism he explained in terms of electric currents 
in the magnetised body, an idea he extended to the magnetism of the Earth. His 
abstinence deliberately recalled the way Fourier had not offered a physical account 
of heat and heat diffusion. His mathematical theory fitted in well with the contem- 
porary and brilliant experimental work of Faraday, and for a time seemed to offer a 
better explanation of the facts than Faraday’s, and so they were widely taken up. 

In 1828 Gauss visited the newly built magnetic observatory that Alexander von 
Humboldt had constructed for Berlin University on French principles. Gauss was 
not impressed; he was used to much higher standards of accuracy in his work on 
astronomy and land surveying. He went back to Gottingen and three years later was 
able to team up with the young physicist Wilhelm Weber, who was appointed to the 
chair of physics at Gottingen in 1831. They were attracted by the opportunities for 
new mathematics, for a new or at least deepening science of electromagnetism, and 
for applications to navigation and geodesy. They were successful in all three respects, 
although we can only follow the first of them here. In this course it is more interesting 
to pursue the development of the general theory, and, Gauss aside, one name stands 
out in the 1830s and 1840s, that of the Englishman George Green. Green published 
first, but his work was not picked up until after Gauss’s had been read, so in terms 
of their reception it is better to take Gauss first. 

Before we do, note also one key difference between potential theory applied to 
the study of mass and applied to the study of charge. The more important departure 
from gravitation to electrostatics follows from the fact the charge in a body is located 
entirely on its surface, so it is natural to study the surface distribution of charge. 
This gives rise to a potential V(p) = [ oe dS, where the integral is taken only 
over the surface. On the surface it is assumed not only to be continuous but to have 
tangential derivatives, and two one-sided normal derivatives: the (outward) normal 


derivative is pe = v,.Vu (the limit is taken along the normal going outwards) and 
the (inward) normal derivative is fe = v_.Vu (the limit is taken along the normal 


'For the next few paragraphs this treatment follows Darrigol’s Electrodynamics from Ampere to 
Einstein, (2000). 
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going inwards). These differ because there is mass in the surface, and their difference 


is: 
Ou Ou =a 
av. Oy, 


where p is the mass distribution on the surface. 

Because of the importance of the subjects to which potential theory could be 
applied, there developed a prodigious industry of finding classes of functions spe- 
cially adapted to particular cases of these equations. For example, if the Earth is the 
solid of interest, it might be considered as a sphere, a spheroid (rather less likely as an 
ellipsoid), or as almost spheroidal, each change bringing greater accuracy and rele- 
vance at the cost of greater complexity. For example, to this day the vibrations of the 
earth caused by earthquakes are studied using functions called spherical harmonics. 
We do not have the time to go in this direction, which requires a certain virtuosity, 
but it should not be forgotten. 


13.3 Gauss 


Gauss’s most important work on the subject is his Allgemeine Lehrsdtze (General 
Propositions) of 1840. In it he first gave rigorous proofs of results already published 
by the French—although Gauss for some reason did not mention them by name. 
Then he set out some new results of his own. The first is a reciprocity theorem: If 
and y’ are two mass distributions on regions vol and vol’ respectively, and V and 
V’ are their associated potential functions, then [,,,, Vu = J,,,7 Vi. His proof for 
this is very straight forward. He started with the simplest system, one consisting of 
a single mass M at the point P and with a potential function V, the other of a single 
mass M’ at the point P’ with a potential function V’. In this case 


/ 


M 
MV'(P) = M——_ = MVP). 
|P — P’| 


So a similar expression holds for two mass distributions each consisting of a finite 
number of points, in the form of a sum 


Li Mi Vi (Pi) = Xj Mj Vji(Pj), 


where the sums are taken over points in each system (to get started, suppose one 
consists of one point and one of two, and write out the corresponding expressions.) 

Finally, for continuous mass distribution on curves, or in surfaces or in solids, the 
sum is replaced by an integral. Gauss also paid attention the case when the two mass 
distributions lie on the same surface, which he dealt with as the limiting case of mass 
distributions on two surfaces that are brought together. 
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It is sometimes easiest to regard the reciprocity theorem as the equality [ Vu = 
| Vu’ where the two integrals are taken over the whole of space, and a non-zero 
contribution to an integral is made only where there is mass. 

From this Gauss derived an expression for the potential V (0) at the centre of a 
sphere S of radius R due to a mass distribution jz outside the sphere and having a 
potential V. This is his Theorem of the arithmetic mean 


1 
V. 
47 R2 [ 


Gauss gave this proof. Let S denote the sphere of radius R, B its interior, and 
X its exterior, X = R* \ (BUS). We shall compare three mass distributions. The 
first is given by a function y in the region X that lies outside the sphere S; this 
mass distribution has a potential V. The second mass distribution, j1;, is a uniform 
distribution on the sphere of mass 47 R*. The potential function associated with this, 
Vi , 1S aconstant, 47 R, inside the sphere, and at a distance r > R outside the sphere 
the potential is 42 R*/r. Notice that these two functions for the potential associated 
to the second mass distribution agree on the sphere. It is a not entirely trivial fact 
that in regions outside the sphere the potential is the same as the potential of a mass 
of 47 R? at the centre of the sphere, and this leads us to our third mass distribution, 
2, which is indeed a mass of 47 R2 at the centre of the sphere, which is the origin, 
O, with the corresponding potential function V2(r) = 47 R?/r. 

Now, the reciprocity theorem applied to the first and second of these says 


[.vu=f Vip. 
R3 R3 


We write each integral as a sum of three: fp; = [_ + f st Jy. and we then find that 
some of the contributions are zero: 


[vis [vm f vino [ vai+o. 
B AY x S 


because the mass distribution ju; is concentrated on the sphere S. So the left hand 
side reduces to [, sVi = ff s V, because the mass distribution is uniform. The right 
hand side, when similarly written as a sum, becomes 


[vm [vm f vin=040+ | Vin 
B S x xX 


because the mass distribution js is concentrated in the region X. Furthermore, 


V(0) = 


bE xy Vin= be x 4nR? jt. So the first use of the reciprocity theorem says that 


; 
i ee 
V= Le. 
S x r 
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To deal with the second of these integrals, we apply the reciprocity theorem to 
the first and third integrals. It now says 


[ves | Vo}. 
R3 R3 


We write each integral as a sum of three: fg; = J, + J. st Jy, and we again find 
that some of the contributions are zero: 


[viet [viet | vie= f 1a+o-+0, 
B Ss x B 


because the mass distribution j2 is concentrated at the centre of B. So the left hand 
side reduces to [, o V2 = 4 R7V (0). The right hand side, when similarly written 
as a sum, becomes 


[vn | vans f vou = 040+ | van 
B S Xx x 


because the mass distribution jz is concentrated in the region X. So this use of the 
reciprocity theorem says that 


Ar R2 


At R7V (0) = LL. 
Xx 


: 2 
But the first use said that [> V = fy tk Ll. So we deduce that [; V = eV (0), 
which establishes Gauss’s Theorem of the arithmetic mean. 
Gauss then showed that if the given mass distribution also has a total mass M® 
inside the sphere then 


| V = 47R°V(0) + 40 RM°. 


From these results we can deduce (although Gauss did not) that this implies that 
the potential cannot have a maximum or a minimum outside the mass distribution. 

Gauss then proved what is today called Gauss’s law or Gauss’s theorem: 

If V is a potential due to a mass distribution and M the total mass inside a closed 


surface S then 
OV 


= a 
5 OS - 


The theorem has this corollary: If there is no mass inside then 


/ VV.n= 0. 
surf 


Gauss then proved that 
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/ wv =— | V(VV.n). 
vol surf 


It will be convenient to follow Green’s proof a little later on, which is not that different 
from Gauss’s anyway. For a sketch proof and a critical comment, see the remarks at 
the end of this chapter. 

From this Gauss deduced that if a potential due to a charge distribution on a surface 
is constant then it is zero everywhere if and only if the total charge is zero, and the 
potential is smaller than this constant value away from the surface if and only if the 
total charge is different from zero. In particular, the potential function itself cannot 
be zero everywhere in a region without vanishing altogether, as we shall shortly see. 


13.4 Green 


The English mathematician George Green was born on 14 July 1793, in the town of 
Nottingham. He inherited a successful milling and bakery business from his father, 
and on the basis of his own reading in libraries and with a little help from local 
mathematicians who had passed through Cambridge he set to work on his own in 
mathematics. He read what he could of the relevant literature, which was not much: 
Laplace’s Mécanique Celeste, some papers by Poisson, Biot, and Coulomb (more, 
by the way, than he could have learned if he had gone to Cambridge). Then, in 1828 
he wrote the remarkable work entitled An Essay on the Application of Mathematical 
Analysis to the Theories of Electricity and Magnetism, which we shall consider below 
and which brought him to the attention of Sir Edward Ffrench Bromhead, a local 
dignitary and Cambridge graduate. In due course Green himself went to Cambridge, 
where he was a student and then, for a brief time, a Fellow. At Cambridge he wrote 
his few other papers, which deal with magnetism, electricity, and hydrodynamics, but 
finding little to stimulate him there he returned to Nottingham, where he died a few 
weeks short of his forty-eighth birthday, not even, it would seem, a local celebrity. 

Green had become interested in the hugely popular topic of the emerging new 
physics of electricity and magnetism. People were excited by dramatic public displays 
of experiments, and less flamboyant scientists discovered new phenomena for which 
there was no satisfactory theory. Green set himself the task of facilitating, as he put it, 
“the application of analysis to one of the most interesting of the physical sciences”. 
The new physics was to call for new mathematics, and was based on intuitions of a 
non-Newtonian kind. This was to prove the secret of their lasting impact. The very 
word ‘analysis’ in the title reminds us that Green was self-taught, for it refers to 
the calculus as it had become in France, not to the sterile exercises in Newtonian 
methods preferred in England. 


?We know remarkably little about Green, and much of what we do know was discovered by the inde- 
fatigable Mary Cannell and can be found in her fine book on Green: George Green, Mathematician 
and Physicist, 1793-1841 (1993). 
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To Green is due the convenient term “potential function’ that describes the effect 
upon a point of forces coming from a system of bodies. The introduction of the 
name helped move the concept to centre stage. Green had learned from Laplace and 
Poisson that the potential function satisfies two partial differential equations: one 
for points outside the body, and another for points inside. He then formulated what 
today is called Green’s theorem, which can be thought of as a reciprocity theorem, 
and which generalises the fundamental theorem of the calculus to several variables. 

Green’s reciprocity theorem concerns a closed surface S enclosing a volume V, 
and two functions, U and U’, of the 3 variables x, y, z. We may write dv for the 
volume element in V and ds for the area element of S, although I shall usually 
suppress that part of the notation. We also write fn for the normal derivative of a 


function f—Green wrote 6 where we write A, w for what we write as n, and (<4) 


for what we write as fn. 
Green’s reciprocity theorem says that 


[vavia- | u'audv= f[ v'Uy— | UUy, 
Vv Vv S S 


One way to understand this is to take U’ to be the constant function 1, when the 


theorem reduces to 
| AUdv = -{ Un, 
V Ss 


when it now says Gauss’s law. 

It seems that Green initially found his reciprocity theorem difficult to prove, but 
he eventually came to this elegant proof. He considered the integral i a VU.VYV, 
where U and V are arbitrary functions, so V7U ¥ 0. Green, more or less as Gauss 
was to do, integrated this expression by parts, taking each of the three parts of the 
sum separately, which gave him 


[ vor | vin.vu) — | VAU. 
vol surf vol 


This reduces to Gauss’s law when U is a potential function and there is no mass in 
the region being integrated over, but in the general case that Green was working with 
the final term does not vanish. However, Green noted that U and V can be exchanged 
and the integral does not change, so it is also the case that 


[ vv = | uve - | UAV. 
vol surf vol 
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On subtracting this from the first equation, he obtained the desired result: 
Ml VAU -{ UAV= / [U(n.VV) — V(n.VU)] (13.1) 
vol vol surf 


It remained for Green to show how to solve Laplace’s and Poisson’s equation 
and so determine the potential function for a wide variety of distributions of mass 
or charge. He had the remarkable idea of solving these equations in extreme cases, 
where the solution is easy to discover, notably the case where the potential was 
caused by a single charge at an isolated point. In this case the potential function 
therefore satisfies Laplace’s equation in any region that does not contain the point. 
Green assumed that the potential increased like 1/r as one neared the point charge, 
and showed how to solve the equation for such a curious function by an attractive use 
of the consequences of Green’s theorem that he had developed in the earlier part of 
his Essay. These functions are called Green’s functions today, and they are a standard 
part of the repertoire in potential theory. 

To pass from the solution to the problem with these extreme boundary conditions 
to more plausible ones where the electricity is distributed through an entire body, 
Green argued first that a problem with finitely many point charges is solved simply 
by adding the solutions, and second that infinitely many point charges distributed in 
some way are handled by replacing addition with integration, on the assumption that 
the potential function vanishes on the boundary of the body. 

Green’s reciprocity theorem is a piece of rigorous mathematics, whether pure 
or applied, but his analysis of potential functions in terms of ‘Green’s functions’ 
was not, as he was well aware, although he did give a plausible limiting argument 
to show how such potential functions can be defined. Amusingly, naive arguments 
involving Green’s functions have tended to strike mathematicians as likely to make 
sense on physical grounds, and physicists (such as Maxwell) as likely to be amenable 
to rigorous mathematics. 

However, there are many other situations where a point source of influence is 
a natural object to study. To mathematicians among Green’s eventual readers the 
example of a complex function was irresistible. Cauchy had shown during the 1820s 
that much of their theory follows from the distribution of their poles, and later writers, 
such as Riemann, saw Green’s theorem as the natural way to prove the Cauchy integral 
theorem. To a physicist a single force concentrated at a point might represent an 
impulse. A load on a beam might be concentrated at a point. These extreme situations 
have the advantage over more general ones of simplifying the attendant mathematics, 
which is why Green’s functions have become such a powerful idea. 

Green’s Essay was not appreciated in his lifetime, although his later papers earned 
him a modest reputation. That the Essay became known at all was the work of William 
Thomson, who had picked up a stray reference to it and finally tracked the Essay down 
the day before taking his degree and leaving for Paris in January 1845. Thomson was 
entranced. He took the Essay to Paris, and showed it to Joseph Liouville and his friend 
Charles Sturm. They too were excited by it, and so too was Crelle, who immediately 
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accepted it for publication in his journal, the leading mathematical journal of the day. 
It was published in three installments between 1850 and 1854. 

The combination of ideas, their elegance, and their clear presentation even when 
rigour lay out of reach, impressed mathematicians, and rapidly ensured that Green’s 
name was henceforth securely attached to his discoveries, even when some had by 
then been discovered by others. From Crelle, news of the essay passed to Dirichlet, 
the leading mathematical analyst of his day, and from him to Bernhard Riemann, and 
William Thomson remained a staunch advocate of them, as was Maxwell. 

The standard equations of mathematical physics can always be treated using 
Green’s functions. The reason is that the equations are linear, and so may be said to 
have a family of solutions, every one of which is a sum (in a suitable sense) of what 
may be called the basic solutions. If a basis of solutions can be found by solving 
extreme cases (such as concentrating the electricity at a point) then the method of 
Green’s functions will work, and this is usually the case. If the equations are not 
linear, their solutions may still fill out a cone, in which case the ones arising from 
the extremal problems will lie on the boundary of the cone. 


Gauss’s Theorem (After Feynman) 


Given a vector field V and a volume Q enclosed by a surface S, first consider an 
infinitesimal cube in the volume, of sides dx, dy, dz. The flow along the vector field 
out of the upper face of the cube is given by multiplying the vertical component of V, 
which we write V3(upper) by the area of the face, dxdy: V3(upper)dxdy. As for 
the lower face, the flow there is V3(lower)dxdy, so the flow through these faces is 
V3(upper)dxdy — V3(lower)dxdy. This is (V3(upper) — V3(lower))dxdy. Now, 
the difference (V3(upper) — V3(lower)) can be evaluated by taking the first term 
in the Taylor expansion: (V3)-dz. So the flux through these faces is (V3)-dxdydz. 
Therefore the flux through the cube (by looking at the two other pairs of faces) is 
((V1)x + (V2)y + (W3)z)dxdydz. This can be written as V.Vdv. On the other hand, 
the flow is also the sum n.V, the sum of the normal directions. 

We pass from the infinitesimal cube to the finite solid body by integration. The 
adjacent faces of neighbouring cubes contribute equal and opposite amounts to the 
sum, because what leaves one enters the other, and so the contribution here is entirely 
from the surface, and the result is ft y V.Vdv = if 5 n.V. Note that the RHS can be 
written as a normal derivative. 

It is worth pointing out that this informal argument has one interesting flaw: it is 
unclear how the infinitesimal cubes can fill out a body whose sides are not parallel to 
the coordinate planes. When we recall the familiar paradox in two dimensions that 
‘shows’ that the two shorter sides of a right-angles triangle have a length equal to the 
hypotenuse we can begin to see the need for infinitesimal solids with sides parallel 
to arbitrary planes. 

In this fallacious argument, the sides AB and BC of a right-angled triangle (see 
Fig. 13.1) are repeatedly bisected to produce a sequences of curves 
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Fig. 13.1 Two sides 
of a triangle successively 
approximating a third 


ABC, AB, A, BoC, AB3A2B4A1B5A3B6C,... 


then tends pointwise to the edge AC. Each curve has length AB+ BC and the lengths 
of these curves do not, of course, tend to the length AC. 


Chapter 14 
Dirichlet, Potential Theory, and Fourier 
Series 


i 
Gustav Lejeune Dirichlet (1805-1859) 


14.1 Introduction 


Dirichlet is often regarded as the man who, more than any other, brought rigour to 
mathematical analysis. Here we look at his career and then at two of his contributions: 
the Dirichlet principle in potential theory and his study of when a Fourier series 
converges to the function that it represents. 
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14.2 Dirichlet’s Career 


Gustav Peter Lejeune Dirichlet is one of the most important mathematicians of the 
19th Century. He was born in Diiren, near Cologne in Germany in 1805, which at 
the time was occupied by the French under Napoleon (it became part of Prussia 
after 1815). Finding that there was not much opportunity to learn advanced math- 
ematics where he was, at the age of 17 he went to Paris, for Paris was then by far 
the leading place for mathematics in the world. There he made many good social 
contacts, including Fourier, and picked up a strong interest in Fourier series and 
mathematical physics. However, his consuming interest was in number theory, and 
when he was 20 he presented a memoir to the French Academy of Sciences in 
which he showed that many cases of the equation x> + y> = z> cannot be solved 
in integers. The remaining cases were then dealt with by Legendre, but unhappily 
Legendre’s publication unduly minimised Dirichlet’s contribution. Even so, Dirich- 
let had almost completely resolved the first new case of Fermat’s Last Theorem 
since Euler had dealt with cubes. Dirichlet returned to Germany in 1826 where his 
acquaintance with Fourier brought him an introduction to Alexander von Humboldt, 
whose brother Wilhelm had founded the new University of Berlin in 1810, and in 
1828 Dirichlet was appointed a Professor of Mathematics there. In 1831 he married 
Rebecca Mendelssohn-Bartholdy, the sister of the famous composer; Dirichlet was 
a good pianist in his own right. 

Dirichlet went on to make many contributions to number theory, one of the most 
notable being his proof that every arithmetic progression of the form {a + bk |k = 
1,2, ...} where a and bare relatively prime contains infinitely many primes. Through 
his papers and his lectures, he explained Gauss’s work to the rest of the world 
so successfully that he helped make number theory into a major branch of pure 
mathematics, and almost the leading branch in Germany. This work led directly to 
the work of Richard Dedekind in the 1860s and beyond, and thence to Hilbert. 

More important here was Dirichlet’s attitude to mathematical rigour, captured 
marvellously in the remark his life-long friend Jacobi made in a letter to Alexander 
von Humboldt: 


If Gauss says he has proved something, it seems very probable to me; if Cauchy says so, it 
is about as likely as not; if Dirichlet says so, it is certain. I would gladly not get involved in 
such delicacies. ! 


And Dirichlet’s wife tells us that Jacobi would spend hours with Dirichlet 


Being silent about mathematics. They never spared each other, and Dirichlet often told him 
the bitterest truths, but Jacobi understood this well and he made his great mind bend before 
Dirichlet’s great character.” 


!Jacobi to A. von Humboldt, 21 December 1846, quoted in Biermann (1973, 53). 
2Quoted in Scharlau and Opolka (1985, 148). 
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It can be argued that Dirichlet. more than Cauchy or even Gauss, is the man who 
brought rigorous mathematics into the world. Two examples, one more successful 
than the other, will illustrate what that means. The first is his work on potential theory, 
the second his work on the convergence of Fourier series. 


14.3 Dirichlet and Potential Theory 


Green had claimed in 1833 that one can solve this problem: given a solid body and 
a continuous function V defined on its boundary, there is a function f which is 
harmonic on the interior of the body and agrees with V on the boundary of the body. 
His argument was as follows: the successful function minimises the integral 


Oy fOTyY of OEY 
a (51) . (5*) : (55) anaes 


Thomson made the same claim a few years later, in 1847, but the assertion has been 
attached for many years to the name of Dirichlet because of lectures he gave in 
1856-1857 at Géttingen that were published posthumously in an edition by Grube 
in 1876. 

According to Grube, what was known as ‘Dirichlet’s principle’ was the claim that 
there is always a unique function on an arbitrary bounded domain which, with its 
first partial derivatives, is continuous, satisfies Laplace’s equation everywhere in the 
domain, and reduces to a given value at every point of the boundary of the surface. 
So in Grube’s version the Dirichlet Principle was this bold existence and uniqueness 
theorem.? 

In the lectures, Dirichlet observed that the problem of finding such a function 
explicitly “cannot be solved; we can only speak of an existence proof for it. The 
latter presents no difficulty.” He now stated the ‘Dirichlet principle’: 


For every bounded connected domain T there are clearly infinitely many functions u contin- 
uous together with their first-order derivatives, for x, y, z which reduce to a given value on 
this surface. Among these functions there will be at least one which reduces the following 


integral U = f[,,., (gor : (ge 4 (3 y) extended over the domain T, to a minimum; 


Ox 
it is evident that this integral has a minimum since it cannot become negative. We can now 
show the following: 


1. Every such function u which minimizes U, satisfies the differential equation — + oe 
ru ' 


3,2 = 0 everywhere in the domain 7. This already makes it clear that there always exists a 
function u having the desired property, namely that function for which U becomes a mini- 
mum. 


3This remark, and the quote that follows, are taken from Dirichlet (1876, 127-128), quoted in 
Bottazzini, The Higher Calculus, 300. 
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2. Every function u which satisfies the [above, JJG] differential equation within the domain 
T, minimizes the integral U. 

3. The integral U can have only one minimum. It follows from 2 and 3 that there is only one 
function u with the desired property. 


In short, the ‘Dirichlet problem’ —find a function satisfying the above conditions— 
can be solved by an appeal to the “Dirichlet principle’. The principle provides a 
function that minimises a certain integral, and this function solves the problem. 

When this ‘principle’ is applied in a genuine physical setting it seems entirely 
reasonable: nature herself tells you there is an equilibrium distribution of charge, and 
so forth. But there is a question when the matter in hand is entirely mathematical. 
Proofs are required for each of the next three assertions: 


there are functions that extend the function defined on the surface to the whole of 
the domain in question; 

there are functions of this kind for which the integral is finite; and 

there are functions of this kind that minimise the integral. Unless these conditions 
can be met the class of functions Dirichlet wished us to consider is empty and the 
argument stops right there. When Dirichlet said “It is evident that this integral has 
a minimum since it cannot become negative”, the only proper reply is “Not exactly, 
Lord Copper”. The function f(x) = x defined on the open interval (0, 1) is never 
negative but it never has a minimum. The function F(x) = tanh(x) + 1 defined on 
the whole of R is never negative but it too never has a minimum. It is surprising, 
and a little disappointing, that Dirichlet would make such naive remarks, especially 
given the sophistication of his earlier work on Fourier series to which we turn next. 


The fate of the Dirichlet principle will detain us later, when we look first at the 
work of Dirichlet’s greatest student, Bernhard Riemann, and then at the criticisms 
his work engendered among his followers and in the rival school of complex function 
theorists, led by Karl Weierstrass in Berlin. 


14.4 Dirichlet and Fourier Series 


Dirichlet came to the topic of Fourier series knowing only, he said, of one paper by 
Cauchy on the subject. There had, in fact, been quite some discussion of Fourier’s 
ideas in print, and one can suppose that Dirichlet heard them discussed during his 
stay in Paris. For example, Poisson had commented on Fourier’s methods for finding 
the coefficients in a Fourier series as follows: “It seems to me that the formula for the 
expansion of functions in series has not in fact been demonstrated in a precise and 
rigorous manner”,* acomment that Bottazzini says seems to have been widely shared. 
Later, Poisson wrote (1835, 186) “the determination of the coefficients essentially 
supposes that one knows, in addition to the form of the functions, that they are 


expandable”—here I take it that it is the expandability of an arbitrary given function 


4See Poisson (1823a, 46), quoted in Bottazzini, The Higher Calculus 188, as is the next quotation. 
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that is at stake. Bottazzini notes that Sturm endorsed this shrewd observation: “Fourier 
and other geometers seem to have misunderstood the importance and the difficulty of 
this problem, which they have confused with that of determining the coefficients”.° 
The distinction here is worth noting. It might be that the coefficients could all be 
evaluated and the corresponding Fourier series shown to converge—does it follow 
that the Fourier series is equal to the function for every value of the variable (in the 
given interval)? 

Cauchy took up the subject in 1827, dismissed Poisson’s approach as unconvinc- 


ing, and wrote® 


In series of this type, the coefficients of the different terms are ordinarily definite integrals 
that include sines and cosines; and when the integrations can be made, by reason of the 
particular form attributed to the function that it is necessary to expand, it is easily seen that 
the series obtained are convergent. Nevertheless it is always desirable that this convergence 
be demonstrated in a general manner, independently of the values of the functions. 


Cauchy’s “proof” is interesting for a variety of reasons. It uses his new theory of 
complex integration and his theory of residues, it produces a convergence criterion 
that turns out to be flawed, but most of all because it gives us a vivid picture of a 
mathematician at work, pulling ideas from unexpected places and trying anything 
that might work. As Bottazzini points out (p. 190) the resulting paper looks more 
like experimental science than the logical hypothetico-deductive structures we expect 
meta-mathematics to be. Here I pass over the use of complex function theory to get 
to the convergence question. 

Cauchy wanted to investigate the convergence of a Fourier series of a function 
Ff (x) on the interval [0, a]. He took the series, and transformed it into another (using 
his complex methods) and then argued that the convergence question reduced to 
establishing the convergence of a series XU, by comparison with one whose general 
term is w,,. In particular, he claimed that if the series XU, w, converges, and wy, > Up 
as n tends to infinity, then Xv, also converges. (In the case at hand, Cauchy set 
Wn = —x(f (@) — f (0) sin(?2*)). 

Dirichlet began his paper (see the translation in Appendix A.2) by observing 
that the fact that a Fourier series expansion of an arbitrary function converges is 
remarkable (note that he did not doubt the convergence). But, he said, no-one had 
given a satisfactory proof, and so far as he knew only Cauchy had really tried, and that 
proof was flawed by fallacious convergence argument. Moreover, the use of complex 
methods seemed foreign to the topic at hand, and he noted that Cauchy elsewhere also 
spoke of the need for caution when making real functions into complex functions. 

As for the convergence ‘criterion’, that was refuted by this example. Let 


iy (ny (-1)" 
Wn = WF and vy, = Si (1+ er ) 


5See Sturm (1836, 400), quoted in Bottazzini, The Higher Calculus, 188. 
6Quoted in Bottazzini, The Higher Calculus, 188. 
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The first series is convergent, the second one is not because vy = Wy, + 1 so the 
difference of the second from the first is the well-known divergent series 1 + 5 + 


5 +--+. However, both wy, and v, tend to zero as n tends to infinity. 

Dirichlet therefore proposed to establish the convergence of a Fourier series 
directly, and also that the series and the function agreed. The Fourier series associated 
with a function ¢(x) on the interval [—7, 7] Dirichlet wrote as 


1 
Ae PL CORR DACs De ect Um COS Tn ys 


+ a, sinx +a2sin2x +---+dmsinmx+--- 


where the coefficients are given by 


1 Tv 

by = — o(a) cosmada, 
a7 

adn = — o(a) sinmada. 
—T 


The series is therefore composed of terms of the form 
cosmx cosma + sinmx sinma = cosm(a — x), 


by a standard trigonometry identity. The first n terms of the Fourier series can then 
be summed, and the sum is: 


2 d(a)da. 


ax. 


1 i sin(2n + 1)9>* 


WT Jax 2 sin 


So the summation of the Fourier series is established if and only if this integral, today 
known as a Dirichlet integral in Dirichlet’s honour, converges to a finite sum as n 
tends to infinity. 

Dirichlet first considered the case where the arbitrary function @(x) is continuous, 
positive, and monotonic decreasing in [0,h] with 0 < h < 5. He looked at the 


convergence of the integral Us Sinn DE aayd ( as n tends to infinity. The argument 
is delicate and original. The integrand is ‘infinite’ when the denominator vanishes, 
which happens when (7 = 0 and (outside this range of integration, when 3 = 2k7. 
The integrand changes sign whenever sin(2n + 1) vanishes, which happens much 
more often, whenever (2n + 1)( is an even integer, that is, whenever 0 < G = 
sit ; <1. For future use, let us temporarily call an interval any of the regions between 
the points where the integrand vanishes (and so where the integrand has a single sign). 


An aside Fig. 14.1, which is the graph of snttly) in the range [—7, 7], shows 


clearly that the function takes its maximum values at —7, 0 and 7, and otherwise 


14.4 Dirichlet and Fourier Series 149 


Fig. 14.1 The function 40 


sin@lx) in the range [—7, 7] 
sin x 


20 


10} 


Ly a HN i hee 
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contributes small alternating amounts to the integral fe GN ee Remarkably, 


: sin x 
i. sn) dx = 2r. 

Now, the absolute value of the integrand shrinks as one goes from each interval 
to the next, because the function (x) is monotonic decreasing. So the value of the 
integral is a finite sum of an alternating series of positive and negative terms that 
shrink in absolute value, and the question is to establish the convergence of this sum 
as the number of terms included increases without limit. By the mean value theorem 
applied to the function ¢(x) this alternating series is a sum of terms px times an 
integral of the form ise i) a wns d(3. Because the absolute values on each interval 
are shrinking, the absolute values of the p, are all less than the absolute value of the 


first one, p1. So the problem is reduces to the study of sums of terms of the form 
kr/n sin 
| om 0, 
(k-)r/n_ Sin GB 


As n tends to infinity the integral ie i ok ann? dG tends to 


ae sin 7 
———dy/n 
(k—1)n Siny/n 


sin 


—put 7 = Bn. Happily, ic = AY is finite, in fact equal to [—A non-trivial result 
first obtained by Euler. So the alternating series converges as n tends to infinity, and 
with it the integral in (. 

What then of the Fourier series? Dirichlet now understood the value of the integral 
in @ and therefore the integral in a — x, which I repeat here for convenience: 


2 
a)da. 
TJ, 2sin* o@) 


1 [ sin(2n + 1)%* 


His arguments apply to two parts of this integral, the integral from —7 to x and 
from x to 7, as can be seen from the substitution x — 23 = a in the first case and 
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x + 2(@ = a in the second case. Note that when you make these substitutions the 
variable x turns up in the upper end points of the integrals, which now run from 
0 to rie and from 0 to *;* respectively. The result of this is that the conclusions 
of the preceding arguments have to be modified somewhat, and they now give that 
the limit of the integral from —7z to 7 (and so the sum of the Fourier series) is 
5 (P(x +) — d(x — €)) where ¢ is an infinitesimal. So, if ¢ is continuous, the limit 
is H(x). 

What class of functions has Dirichlet shown to have convergent Fourier series? 
Those functions ¢ for which (x) is continuous, positive, and monotonic decreasing. 
Dirichlet now argued that for more general functions ¢ one can consider the different 
intervals on which the new given function is continuous, positive, and monotonic 
decreasing or increasing. The original argument works in each of these settings, and so 
convergence is again assured on the whole interval—provided that the decomposition 
of the function ¢ into monotonic regions requires only finitely many intervals. For 
the same reason, the condition that the function ¢ be continuous everywhere can be 
dropped to the extent that the function may be allowed to jump at finitely many points 
(which produce a further decomposition of the original interval). Dirichlet spoke of 
the function having sudden changes at isolated points. 

The convergence of the Fourier series was therefore established by Dirichlet for 
functions (on the interval [—7r, 7]) with finitely many points where they have jump 
discontinuities and which have a finite number of maxima and minima. The value of 
the Fourier series at the jump was shown to be the average of its values, one might 
say, on either side of the jump. 

Dirichlet then added a famous remark. The necessity of imposing the condition 
that the function have only finitely many jumps was felt, he said, by contemplating 
a function @(x) that takes one value c when x is rational and another value d when 
x is irrational. This strange function, never previously considered in mathematics, 
was introduced to show the value of a certain restriction needed for the proof of the 
theorem. It could be, of course, that the restriction could be partly lifted; Dirichlet’s 
function showed however that some restriction was necessary. 

Dirichlet’s proof had also imposed some other conditions on the functions that 
could be shown to agree with their Fourier series. They must have only finitely many 
maxima and minima in any interval. Dirichlet noted that this condition too required 
both comment and further investigation, and expressed the hope that he would return 
to the topic in a later note, but he was never to do so. There is some evidence that he 
was optimistic on this point, but if so he was to be proved wrong in 1876, long after 
his death, as we shall see. 

Dirichlet’s work shows a shift of concern, from the description of expected prop- 
erties (‘every function has a convergent Fourier series to which it is equal’) to the 
perceived need to establish these properties together with the observation that they 
cannot be expected to hold in general. 


A Trigonometric Argument 


We want to evaluate 5 + Mie cos(kt), where t = x — a. Consider 
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1+ >° (cos(kt) +i sin(kt)) = 1+ Doe 


k=1 k=1 

_ eit Dt eT in+)t/2 _ el (at )t/2 ei(n+)t/2 
aa ae" eit/2 _ git/2 eit/2 
_ sin + Dt/2) pinta 

sin(t /2) 

i 1)t/2 

= eee (cos(nt /2) + i sin(nt/2)). 
So 

a. sin((n + 1)t/2) i 
= kt) = ———___— t/2)-——. 
5 pe ) sin(t/2) cos(nt /2) 5 


To bring this into line with the neater expression that Dirichlet used, we write 


sin((n + 1)t/2) 1 sin((n + 1)t/2) cos(nt/2) — 5 sin(t/2) 
————— cos(nt /2) — : . 
sin(t/2) 2 sin(t /2) 


Then we use the trigonometric identity 2 sin A cos B = sin(A + B) +sin(A + B) to 
rewrite this as 
5 sin((2n + 1)t/2) + 5 sin(t/2) — 5 sin(t/2) _ sin((2n + 1)t/2) 
sin(t/2) ~  2Qsin(t/2) 


’ 


which is the expression Dirichlet used. 


Chapter 15 
Riemann 


Bernhard Riemann (1826 —1866) 


15.1 Introduction 


This is the first of four chapters in which we look at various aspects of Riemann’s work 
and the way it changed the theory of real and complex functions. In the short space of 
fifteen years—1851 to 1866—Riemann rewrote both subjects as he did differential 
geometry and analytic number theory. Here we look at how his introduction of 
trigonometric series opened the way for mathematicians to study the properties of 
non-differentiable functions. 
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15.2 Lipschitz 


Among those who responded to Dirichlet’s paper on Fourier series was Rudolf Lip- 
schitz in his doctoral dissertation (Lipschitz 1864). Lipschitz considered the ways 
in which a function could fail to satisfy the sufficient conditions that Dirichlet had 
established for the convergence of a Fourier series. For example, the function f might 
be discontinuous at an infinite number of points, although otherwise monotonic and 
piecewise continuous in the interval [—7r, 7r]. But the difficulty here was that Dirich- 
let’s concept of the integral did not apply in this case, and so Lipschitz wondered 
if the integral could be suitably redefined. He came to the conclusion that it could, 
at least if the set of discontinuities was nowhere dense.! Here his argument was 
incorrect, but this was not noticed for a time, which shows how difficult this question 
really is. 

Lipschitz’s fallacious argument was that if D, the set of discontinuities, was 
nowhere dense, then D’, the set of limit points of D, must be finite. In this case 
an argument about singular integrals can be mounted that extends Dirichlet’s theo- 
rem appropriately. For example, if D’ consists of the single point c, then f has at most 
finitely many points of discontinuity in any interval (—7, c — €) and in (c+ €, 7), 
for any €¢ > 0. We shall recall later the significance of looking topologically at the 
point set D and not in a way that illuminated the concept of integrability. 

Once he was past this mistake, Lipschitz then argued correctly that Dirichlet’s 
monotonicity condition could be replaced by a condition on the function that is 
stronger than piecewise continuity but weaker than differentiability—the original 
Lipschitz condition. 


15.3 Riemann 


Riemann was the archetype of the shy mathematician, not much drawn to topics other 
than mathematics, physics, and philosophy, devout in his religion, conventional in 
his tastes, close to his family and awkward outside them.” As a child, he was taught 
by his father, a pastor, and then for some years at school before going to Gottingen 
University. There he had initially intended to study theology, in accordance with his 
father’s wishes—G6ttingen was the only university in Riemann’s native Hanover 
with strong links to the Hanover church—but his remarkable ability at mathematics 
led him to switch subjects. He was always inclined to the conceptual side of things, 
rather than the computational or algorithmic, and interestingly enough never learned 
to write his native German eloquently, and his Latin (required for academic purposes) 
was even worse. 


'Tn modern terms, a set A is nowhere dense in another set B if the closure of the interior of A is 
empty. We shall see that this concept was not reliably understood in the period before it was made 
precise by the techniques of 20th century topology. 


?For a biography of Riemann, see Laugwitz, Bernhard Riemann (2000). 
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Although Gauss was in Gottingen, where he taught statistics, the level of instruc- 
tion at Gottingen was not high, and Riemann transferred to Berlin in 1847; German 
students were allowed to study at any university. There he found his ideal instructor 
and guide in Dirichlet, from whom he learned about potential theory and partial dif- 
ferential equations, number theory and theory of integration. He also took Jacobi’s 
classes in analytical mechanics and higher algebra, and it has been suggested that 
these lectures might have stimulated Riemann’s tendency to think in an abstract and 
sophisticated way about the relation of mathematics to physics and the real world.* 

Riemann returned to Gottingen in 1849 and attended Wilhelm Weber’s lectures 
of mathematical physics and for a while he devoted himself to studies in physics and 
Naturphilosophie. He met Richard Dedekind, who was greatly influenced by him 
throughout his long mathematical career, and in 1855 Dirichlet came to Gottingen as 
Gauss’s successor. Dirichlet was a major influence not only on Riemann’s work on 
trigonometric series and number theory, but also on his general, abstract approach. 
However, Dirichlet died in 1859. Riemann was quickly appointed his successor, but 
as it was to turn out he had by then published nearly all of his major papers, for 
he contracted pleurisy in 1862. The only treatment was to spend as much time as 
possible in the South, and whenever he could he travelled to Italy, where on 20 July 
1866 he died near Lake Maggiore, where he is buried. His friends arranged for his 
published papers, several unpublished ones that were found to be in a good enough 
state, and some that could be edited from notes, to be published, and the first edition 
of his Werke (edited by Dedekind and Heinrich Weber) appeared in 1876. 

No-one disputed Riemann’s brilliance, but he had few students, and among them 
Hankel and Roch died young, and Hattendorff at 48. His successor at Gottingen was 
Alfred Clebsch, who was joined by Ernst Schering, who was not a truly creative 
mathematician. Clebsch stepped forward to take up Riemann’s ideas, but died, aged 
39, in 1872, and eventually Felix Klein took up the task, but by then Berlin had become 
the dominant centre for mathematics in Germany, and Weierstrass and Kronecker 
much preferred algebra to geometry, let alone topology, and Riemann’s deepest 
ideas fell into suspicion. His preference for highly abstract arguments over long 
calculations and explicit methods was alien to them, and rather than make Riemann’s 
insights rigorous on, as it were, Riemann’s own terms Weierstrass and his former 
student Hermann Amandus Schwarz frequently sought to rewrite them in their own, 
very different, manner. 


15.4 Riemann’s Publications 


The first of Riemann’s publications was his doctoral thesis on the foundations of a 
theory of functions of a complex variable. In it he presented the idea of a ‘Riemann 
surface’ as a branched covering of the complex plane, and outlined a proof that 
any two disc-shaped regions of the plane are not only topologically equivalent but 


3See Pulte in Jacobi (1996). 
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analytically so (there is a 1-1 analytic map from each one to the other). The thesis 
was privately distributed, and had no impact until it was reprinted 25 years later in 
his Collected Works. 

Riemann could now work for the crucial post-doctoral qualification the Habil- 
itation (a necessary and sufficient condition to be allowed to lecture at a German 
University). In 1854 he successfully presented his Habilitation essay, on trigono- 
metric series and the theory of functions, and gave his Habilitation lecture, on the 
foundations of geometry. Whole branches of mathematics stem from each of these. 

In the lecture he argued that mathematics is about ‘n-fold extended quantities’ 
(‘n-tuples of real numbers’ is the equally unattractive modern term) to which is added 
some appropriate extra structure, such as a metric to measure distance. This view of 
differential geometry gradually allowed mathematicians to propose many different 
descriptions of physical space, because any set of n-fold extended quantities with a 
metric will do, and as a result Euclidean geometry became just one possibility among 
many. In particular, complex numbers are pairs of real numbers; some extra structure 
(given by the Cauchy—Riemann equations) enables one to define analytic functions. 


15.5 Riemann’s Paper on Trigonometric Series 


One of the papers Riemann wrote for his Habilitation in 1854 was undoubtedly the 
one he wrote under the influence of Dirichlet.’ Its subject is the representation of a 
function by its Fourier series. It opens with a remarkably thorough account of the 
history of the topic to date, going back as far as Euler and d’ Alembert, and which also 
contains information obtained directly from Dirichlet himself. This account forms the 
kernel of every historical account written since. Turning to the mathematics, Riemann 
conceded that functions that do not satisfy Dirichlet’s conditions are unlikely to 
occur in nature, but he still felt that they were worth studying for two reasons (see 
Riemann 1867a, §3). One was that Dirichlet had argued that “the topic has a very 
close connection with the principles of infinitesimal calculus, and can serve to bring 
greater clarity and rigour to these principles.”. The other was that Fourier series were 
useful in pure mathematics, including number theory, where precisely the sorts of 
functions that Dirichlet had excluded were important. 

Riemann began his investigation by redefining the Cauchy integral (see the extract 
in Appendix A.4). Cauchy had considered only those subdivisions of an interval into 
subintervals of equal size. Riemann proposed to let the intervals vary, and said that 
a function f on a given interval was integrable if the area given by the obvious 
rectangles tended to a limit as the widths of the intervals in the partition tended 
to zero. This is not much of a departure from Cauchy’s idea. But then Riemann 
argued that if the oscillation of the function on each interval is also bounded and the 


4The entire paper has been translated into English and published in Bernhard Riemann, Collected 
papers, translated by Roger Baker, Charles Christenson and Henry Orde, Kendrick Press, 2004; see 
pp. 219-256. 
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contributions of the ‘tops’ of the rectangles is bounded as the intervals tend to zero, 
then the function will still be integrable. His idea was that each partition presents 
an approximation to the function as a step function. If the successive step functions 
do not differ uncontrollably as the intervals shrink, then the function should have an 
area. 

So Riemann defined integrability without reference to continuity. Next he turned 
his attention to functions with infinitely many points of discontinuity. As he put it 
(1867a, $6) “Since these functions have never been considered before, it will be good 
to give a definite example”. He first defined the function r(x) as follows.> For each 
real number x, 

Wars x — m(x) if x ~n/2 
0 ifx =n/2 


where m(x) is the integer such that |x — m(x)| is a minimum, and n is odd. So the 
function r(x) takes values between —1/2 and 1/2, and is zero where x = n/2 with 
n an odd integer. Its graph is discussed and illustrated in Sect. 15.6. 

Riemann then defined the function 


2 
FO) =r) OP pe 


This function is discontinuous at every point x = m/2n where m and 2n are relatively 
prime; this is a dense set of rational numbers. At these points, the right and left hand 
limits of the function f are 


_ 1 1 7 T 
faH=f@)-s5(>, Gran? |=! jee 


k 


and 


_ I 1 _ me 
fO-) =f) + 55 2d Ora? |= + Tee: 


Nonetheless, the function f is integrable, because the contribution of jumps exceeds 
a given bound at only finitely many points. 

A good way to think of Riemann’s fundamental question about integrals, which 
led him to redefine the Cauchy integral, is to think of the integral [ i f (@)dt as x goes 
from a to, say, b as calculating a moving average. To be precise, by the intermediate 
value theorem, A Ff (t)dt = (x —a)g(x), where g(x) is the average value of f(x) on 
the interval (a, x). So Riemann was looking for functions which may oscillate wildly 
and even be discontinuous, but which always have an average value at every point of 
an interval on which they are defined. From this perspective it is not only reasonable 
that an integrable function can fail to be defined at some points (the average of 


5Riemann’s notation for this function was (x), but this causes problems by disappearing when the 
function is composed with others, so I have introduced this notation. 


158 15 Riemann 


infinitely many values should not be affected if we lose finitely many values) it is 
quite plausible that many points of discontinuity and some degree of oscillation in 
the values of the function should not affect its integrability. In the example of the 
case just considered each jump affects the average value by a finite amount, but the 
function is integrable because the contributions of the jumps are negligible except at 
finitely many points. 

Commentators agree that Riemann’s deepest insight came with his radically new 
point of departure. He began with a trigonometric series, which he wrote 


(x) = Ao + Aix) + Ar(x) +--+ + An@)+---, (15.1) 


where 
Ap = do/2, An(x) = ad, cosnx + by, sinnx. 


In §9 he assumed that A, (x) converged to 0 as n increased—in fact, he treated this 
as uniform convergence throughout the paper: the convergence is for all values of x 
under consideration. Next, he formally integrated the series (2 (x) twice to obtain the 
series 


An(x) 


n2 


F(x) =C+C'x + Aox’/2- >> (15.2) 


The hypothesis on the coefficients A,,(x) ensure that the series F(x) converges for 
every value of x and represents a continuous function of x. In fact, the function F (x) 
has the important properties that (Lemma 1): 


DF) = tim PEtO+A—F@+a- 8) ~ Fe - ath) + F@-a-s) 
a,B>0 AaB 


exists and equals &2(x) whenever 2 converges. 
Also (Lemma 2): 


. F(x+ta)—2F(x)+ F(x -—a) 
lim = 


a>0 a 


0 (15.3) 


for all x. 

Riemann then proved the important result (Lemma 3): today known as Riemann’s 
Lemma. 

Let b < c be arbitrary real numbers, A(x) a function with continuous first dif- 
ferential quotient within [b, c], and which vanishes at b and c, and whose second 
differential quotient has only finitely many maxima and minima in [b, c]. Then 


lim vf F(x) cos u(x — a)A(x)dx = 0. 
E> 00 b 
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The proof is astute manipulation of series, rather than difficult. 

Using these results, in §9 Riemann derived necessary and sufficient conditions 
for a function to be represented at a point by a trigonometric series (2. These were 
that 


1. Fora function f to be representable on [0, 27] by a Fourier series, there must be 
a continuous function F, derived from f by double formal integration, such that 
Lemmas | and 3 hold. 

2. Conversely, if these conditions hold, f is representable by the Fourier series 
everywhere the Fourier series converges. 


The proofs of these conditions made heavy use of the Dirichlet integral, and will 
not be discussed here. Nor were they found to be convincing, although they are 
correct: Smith and du Bois-Reymond both disputed the sufficiency. 

In §10 Riemann made the remarkable observation that if the coefficients of Q 
become infinitely small then the convergence at a value of x depends only on the 
immediate neighbourhood of x. This is in marked contrast with the idea that the 
series is defined on a whole interval, and if it were a Fourier series its coefficients 
would be determined as integrals over that interval. But it is absolutely in line with 
Dirichlet’s observation in the much simpler case that he had considered. 

Thus far, Riemann has assumed that the coefficients of the trigonometric series 
F(x), Eq. (15.1), all become arbitrarily small for large enough n whatever the value 
of x. He now started to weaken this assumption—I omit a discussion of this section. 

Thus far he had largely considered functions of the kind that Dirichlet had looked 
at. In §12 Riemann began a study of the representation of functions which become 
infinite at finitely many points but have only finitely many maxima and minima, by 
trigonometric series whose terms become arbitrarily small. Then he considered the 
representation of functions which have only finitely many maxima and minima, by 
series whose terms do not become infinitely small. 

Then in §13 he opened the door to all sorts of unexpected, and largely unknown, 
functions. A list of his results must suffice: 


1. Functions with infinitely many maxima and minima can be integrable without 
being representable by a Fourier series. For example, 


d(x” cos(1/x)) 
dx 


on 0 < x < 27, with 0 < y < 1/2. His proof involves showing that the integral 
that would define the Fourier coefficients produces a quantity that does not satisfy 
the conditions laid down in the above lemmas. 

2. Conversely, there are functions which cannot be integrated but can be represented 
by a trigonometric series. For example, the function 


S r(kx) 
pier 


k=1 
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exists for every rational value of x and is representable by 
3 s(x) 
—— sin(2n7x), 
om 


where s(x) := >> of — (—1)¢ and is taken positive or negative according as n is 
odd or even, and a ranges over all the divisors of n. This series is not bounded in 
any interval, no matter how small, and consequently is not integrable. 
Another example is 

Len COS nx, 


where cy, — 0 but Sc, = oo. As Riemann explained, “For, if x /27 is rational 
with denominator m in lowest terms, then clearly the series either converges or 
diverges to infinity according as 


m—1 m—1 


> cosn?x, > sinn?x 
n=0 n=0 


are zero or not. Both cases arise, by a well-known theorem on partitioning the 
circle, for infinitely many values of x between any two bounds, no matter how 
close.” (The reference was to Gauss, Disquisitiones Arithmeticae §356.) 

3. Likewise the trigonometric series (2 can converge without the series obtained 
from it by term by term integration being integrable on any interval, however 


small. For example 
1 —log(i — q”) 
zh 0-eme(M=?), 
where the logarithms are chosen so as to vanish for g = 0. By Riemann’s lemma, 
this gives an example of a function representable by a trigonometric series that is 
not a Fourier series. 


4. Finally a trigonometric series can converge for infinitely many x in any interval 
and yet the coefficients not tend to zero. For example 


x sin(n!7x), 


which converges not only for every rational value of x but also some irrational 
numbers, such as sin 1, cos 1, 2 and its multiples, etc. 


Figure 15.1 shows the first four terms of Riemann’s series (4), which converges 
at some points even though the coefficients do not tend to zero. 
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Fig. 15.1 The first four 
terms of “7° sin(n!x) 


Summary 


Riemann’s key insight was the introduction of the trigonometric series. No assump- 
tion was made about the coefficients a, and b,, in particular they need not be the 
coefficients of a Fourier series. This distinction between a Fourier series and a trigono- 
metric series was introduced by Riemann, and he gave examples of trigonometric 
series that are not Fourier series. 

Riemann took some steps towards investigating functions that do not satisfy 
Dirichlet’s conditions for representation by their Fourier series, but historians agree 
that he found more problems than he could solve. He was not to be the first! But in 
his case it may have inclined him to keep the paper back from publication, so it was 
only published for the first time in 1867, after his death the previous year, and only 
then did it begin to influence the theory of real functions and their Fourier series. 

Strange functions, however, continued to occupy him, and Weierstrass was later to 
recall that he first heard in 1861 or even earlier from students who had attended Rie- 
mann’s lectures in Gottingen that the function represented by the trigonometric series 


Si 2y . . . . . * . . 
ye sn) is a continuous function that is nowhere differentiable. This statement is 
not quite accurate mathematically—I shall correct it in due course (see p. 204). 


15.6 Riemann’s Function 


Riemann’s strange function is very nearly the same as the function g(x) = x — 1 — 
L(x — 1/2)]. Both have period | and the same graph except that Riemann’s function 
vanishes at the points n/2 where n is an odd integer and the new function g takes the 
value —1/2 there. For graphing purposes these differences are insignificant, because 
we may change the value of a function at finitely many points without changing its 
integral. But these differences need to be remembered when taking an infinite sum 
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Fig. 15.2. Riemann’s J)" 
function g(x) ra 


-1 -08 -06 _-04 -02 0.2 0.4 0.6 0.8 1 
x 
0.27] 


Fig. 15.3. The first 15 terms 0.64 
of the sum defining 
Riemann’s function 


-1 08 -06 |-04  -02 70 02 04/06 08 
24 ¥ 


(which generates an infinite set of discrepancies). We can study finite approximations 
to Riemann’s function 


as if they are 


The function g(x) has a graph like this (see Fig. 15.2; the verticals are an artefact 
of Maple and are incorrect mathematically—the function has a jump discontinuity 
at those points). 

Figure 15.3 shows the first 15 terms of the sum defining Riemann’s function. 

The next graph, Fig. 15.4, shows the difference between the first 15 and the first 5 
terms of the sum defining Riemann’s function. It shows how crinkly the successive 
graphs are becoming. 
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Fig. 15.4 The crinkliness of 
terms 6 to 15 of the sum 
defining Riemann’s function 


15.7 Exercises 


1. What value could be attached to unrigorous mathematics in the 1840s? Give 
examples of mathematics showing various levels of rigour. 

2. Does mathematics have to be more rigorous than physicists require? 

3. How can a continuous function fail to be differentiable? Can you give any exam- 
ples of a function that is continuous on an interval but fails to be differentiable at 
infinitely many points? What properties would a function have to have for it to 
be continuous on an interval and differentiable nowhere? 


Chapter 16 
Riemann and Complex Function Theory 


16.1 Introduction 


In his doctoral dissertation of 1851 Riemann gave a complete, clear introduction to 
complex function theory as an autonomous domain in mathematics. He gave the sub- 
ject a clear starting point in the Cauchy—Riemann equations, brought in geometrical 
ideas that he may have learned from Gauss, and developed a profound connection 
with the theory of harmonic functions. He presented a theory, with uncertain levels 
of rigour, that applied not only complex-valued functions defined on the plane of 
complex numbers but to complex-valued functions on any two-dimensional domain, 
and outlined a proof that all simply-connected domains can be mapped one onto 
another by complex analytic maps. For a translation of the start of this paper, see 
Appendix A.3. 


16.2 The Doctoral Dissertation of 1851 


Riemann saw with remarkable clarity was that what was required of a function f of 
x and y for it to be a complex-valued function of a complex variable z = x + iy 
was that it could be treated as a function of a single variable, specifically, that it be a 
differentiable function of z. This requires that the expression af make sense, which 
means that the expression : 


ki f(z+4)— f(z) 
im ———-R-——_ 
530 6 


must be independent of the direction in which 6 tends to zero. For, if there were to be 
a sense of direction for 6 it would have to be in a space of more than one dimension, 
contradicting the basic idea that the variable z is a single thing and not a compound 
thing. He wrote 
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f@=fatiy)=u+iv=w and 6=dx+idy, 
supposed that dx + idy = ce*', and wrote down the corresponding expression for 


du +idv 
dx +idy’ 


du+idv 1 (Ou = Ov 4. 1.fdOv Ou 
= L 
dx+idy 2\0x Oy 2 \oOx = Oy 


a (> =) +i(2+2)) 209i 
~{{({— -— i{—+— ; 
2\\Ox Oy Ox Oy : 


For this to be independent of direction, it must be that the term in y vanishes, so 


namely 


Ou Ov Ov Ou 
Sa d —=-—. 16.1 
Ox Oy i Ox Oy re)) 


These are the celebrated Cauchy—Riemann equations, and Riemann rightly took them 
as fundamental. In § 4 of his paper he rederived them by writing 


(3 +i9t) dx + (3 — i$) idy 
dx + idy 


and observing that this expression takes the same value for all values of dx and dy 
when Eq. (16.1) hold. He now added that the functions u and v satisfy 


Pu Pu é Pv Hv r ies 
oe age te oe oe.) 
He did not call them harmonic functions, but he appreciated very keenly the connec- 
tion to potential theory. 

Riemann also observed that if the derivative does not vanish then the map is 
conformal. He may well have picked this idea up from a conversation with Gauss, 
or by reading and appreciating Gauss’s work on conformal maps (see Gauss 1822); 
in any case it was not an idea that Cauchy had used. 

To handle the case of a surface spread out over another so that several points on 
the upper surface correspond to just one point below, Riemann explained what a 
branch point is, using the example of the function w = z’”, which has a branch point 
of order m when z = 0. Here there are m values of w for every nonzero value of z, 
and they are cycled around as z is taken on a circle around the origin. 

Riemann deduced something else from his definition of a complex function: it 
need only be defined on a two-dimensional patch. It might be defined on only a 
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part of the complex plane, or perhaps on some other surface altogether. In this way 
he freed the complex variable to roam over a large class of surfaces; his theory of 
complex functions is not about functions defined on the plane of complex numbers; 
rather, it is about them and their generalisations defined on any two-dimensional 
patch whatever. 

To study a complicated surface he outlined a procedure whereby it could be cut 
into manageable pieces. He said a piece of surface was connected when any two 
points in it can be joined by a curve.! He defined a boundary cut (Querschnitte 
in German) as a simple (i.e. non self-intersecting) curve joining two points on the 
boundary. At every stage existing cuts were counted as parts of the boundary, and to 
get started, if the surface had no boundary at all, one was allowed to delete a point. 
The cutting process would end if any further cut divided the surface into two pieces; 
such surfaces he called simply connected. 

To investigate how the nature of the surface was related to the number of cuts 
needed to make it simply connected, he considered a surface T cut up in two separate 
ways. He supposed that it fell into m, simply connected pieces via a system of 
boundary cuts, and into m2 simply connected piece after m2 boundary cuts, and 
looked at the result of superimposing one system of cuts upon the other. By counting 
carefully he concluded that ny — m, =n» — myo. It follows that if a surface is cut 
into m simply connected pieces by n boundary cuts, then the number n — m is a 
constant, and Riemann called this number the order of connectivity of the surface. 
On this definition, a surface given in one piece is n-fold connected if a system of n — 1 
boundary cuts is required to cut it into one simply connected piece. The term ‘genus’ 
was introduced later in Clebsch (1864): a surface is of genus g if it is 2g + 1-fold 
connected. 


16.3 Harmonic Functions 


Riemann’s very general idea of the possible domains of a complex function made 
it difficult for him to define them by power series expansions, but not impossible. 
This was, indeed, to be Weierstrass’s approach later, but Riemann did not want 
to define complex functions by their representations as a power series. Rather, he 
wanted to define them abstractly in a way that should minimise the otherwise lengthy 
manipulations that would be involved. His preferred way to obtain existence theorems 
for complex functions was to exploit the fact that their real and imaginary parts are 
harmonic. The crucial theorem relates the integral of a function taken over a region 
and the integral of another function taken over the boundary of that region. It was 
known already to Cauchy, Green, Gauss and Dirichlet, and it says: 


OX OY 
; (> 4 =) dT = | (Xcos€ + Ycosn)ds (16.3) 
Ox Oy 


'This is the modern definition of path connected. 
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where the first integral is taken over the surface and the other along the boundary, 
and the angles € and 7) are the angles the normal makes with the x- and y- axes 
respectively. 

Using this theorem, Riemann obtained these essential results: 


e aharmonic function defined on a surface T that covers the plane once possesses 
all its derivatives. ; 

e if both the harmonic function uv and its normal derivative oe vanish along a curve, 
then uw vanishes everywhere, and so if the values of u and its normal derivative are 
prescribed along a curve they are determined everywhere. 

e aharmonic function uv cannot be constant on a piece of surface, and cannot attain 


a maximum or a minimum at a point inside T. 


To obtain these results, Riemann first proved the theorem (16.3) and then applied 
it in the special case when the surface integral vanishes and the surface T has been 
cut into a simply connected surface T*. The theorem now says that the integral along 
the boundary is a single-valued function of its upper endpoint (the lower endpoint 
being assumed fixed) provided that the upper endpoint does not cross the boundary 
cuts. If the upper endpoint moves across a cut, then the value of the integral increases 
by a constant quantity that depends on the cut in question but not on the position 
where the cut is crossed”), and each cut determines an independent quantity. 

Now, to apply this result to complex functions, Riemann had to allow that there 
will be points where the function is infinite. He extended his analysis to deal with 
functions that are harmonic except at a finite set of points, and showed that such a 
function u possesses all its derivatives. To do this he estimated the contribution of 
the ‘bad’ points to the value of the integral along the boundary, and hence deduced 
the existence of all the sought-for derivatives. The other results followed swiftly. 

Riemann now turned to the study of a complex function w = u + iv defined 
on a simply connected region that covers (a part of) the plane just once. Since the 
real and imaginary parts have now been shown to be infinitely differentiable, the 
same is true of the complex function w. To study the case where a function becomes 
infinite at some interior point z’ of its domain of definition, Riemann introduced polar 
coordinates centred on the singular point. If the function behaves like the wth power 
of p, where p is the radius, Riemann let m be the least integer greater than or equal to 
yu. Then (z — z’)” w tends to zero with p, and so (z — Ze w is a function of z. He 
denoted its value at z’ by aj,—1, and by repeating the argument m — 2 times showed 
that the function w can be changed into a finite, continuous one by subtracting an 
expression of the form 


a\ a2 aAm-1 
fe ia as rym—1" 
z-z (z—2/) (z — 2’) 


So, as he remarked, a function that becomes infinite to some finite order does so to 
an integral order. 


>To see this, apply the Cauchy integral theorem to the loop obtained from two different crossings; 
see § 17.4 below. 
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16.4 Riemann’s Defence of the Dirichlet Principle 


Riemann did not mention the Dirichlet principle by name in his paper of 1851, and 
moreover he did not uncritically rely on the principle but attempted to prove it. As 
he explained in next general paper on complex function theory, the monumental 
paper on Abelian functions (1857a), he was concerned in his (1851) to generalise 
this principle to cases where prescribed discontinuities are allowed. 

He argued as follows in his (1851, § 16). He defined an integral over a surface T 


of the form 
_ f (da apy? (da aby’ 


It is required to be finite for arbitrary functions a(x, y) and (x, y); we note that this 
integral will vanish if the functions a and (3 satisfy the Cauchy—Riemann equations. 
Riemann then argued that by varying a by a continuous function that is at worst 
discontinuous only at single points and that is zero everywhere on the boundary of 
T, the integral can be made to attain a minimal value. Once that is established—and 
this is the contentious part—then it is straight-forward to show that the minimising 
function is a unique function (apart from any isolated the points of discontinuity), 
and moreover that this unique minimizing function is harmonic. 

To prove the first claim, Riemann considered a function \ that vanishes on the 
boundary, that could be discontinuous at isolated points, and for which the integral 


w-(@)+@))e 


is finite. He let a + X = w, and considered the integral 


du aB\? (dw aBy’ 
—_—- = — +—) aT. 
ie 3) “Gs ¥ 9) 
which he denoted &2. He then asserted (1851, § 16) that 


The totality of these functions represents a connected domain closed in itself, in which each 
function can be transformed continuously into every other, and a function cannot approach 
indefinitely closely to one which is discontinuous along a curve without L(A) becoming 
infinite. 


So for each A, Q only becomes infinite with L, which depends continuously on \ 
and can never be less than zero; consequently has at least one minimum. 

This argument is too naive to pass as a proof, but it is an attempt. It proceeds by 
generalising the fact that a continuous function defined on a closed interval is bounded 
and attains its bounds to the situation of a function L defined and continuous on a 
‘closed’ set of functions, and claims that in this situation the function L likewise 
attains its lower bound. It even contains an argument that nothing goes wrong in the 
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passage to the limit. If it is to be rejected, the grounds are that the proof is too weak, 
not that the claim was simply accepted. 

It is interesting to note that when Riemann’s student Gustav Roch published his 
account of Riemann’s lectures of 1861/62 (Roch 1863, 1865) he avoided mention 
of Dirichlet’s principle here, although Riemann had mentioned it the lecture course 
for 1855/56 which his friend Dedekind attended.* Roch’s account does go on to 
include the Riemann mapping theorem.* So did repent of its proof? According to 
Klein (see Klein 1894-95, 492), Weierstrass once told him that “Riemann had never 
laid any particular value on finding his existence proofs with Dirichlet’s principle”, 
so perhaps he did. But perhaps he merely withdrew it as being unsuitable for an 
elementary lecture course. 

The two remaining claims follow less problematically. Uniqueness followed an 
examination of functions of the form u + h near to a minimum of u, which shows 
that Q can be written in the form M + 2Nh + Lh. Then, for u to be a minimum 
it must be the case that N = 0. For the same reason, any two minimising functions 
u must give the same value of M and can only differ by having different isolated 
points of discontinuity that do not alter the value of the integral. These arguments 
depended on showing that the limit function cannot be discontinuous along a curve, 
and Riemann established this in § 17. 

Then in § 18 Riemann applied a Green’s function argument to the integral for N to 
show that the vanishing of NV implied that the minimising function uw was harmonic. 

Riemann then deduced that if a complex function is defined on a connected surface 
T for which the integral 


_ [ (da AaB? (da  aB\* 


is finite, and if T can be cut up into a simply-connected region T*, then there is 
always a unique function jz + vi of z such that jz vanishes at all but finitely many 
points on the boundary, v takes an arbitrary value at an arbitrary point, and isolated 
discontinuities of yin T or v in T* leave L(j) and L(v) finite and the values of v 
differ only by a constant at corresponding points on the sides of the boundary cuts. 

This result brings us to the heart of Riemann’s conceptual approach to complex 
function theory and to the split that was to develop in the subject because Weierstrass 
could not accept it at all. In Riemann’s opinion (see Riemann 1851, § 19): 


[These] principles open the way to the study of definite functions of a complex variable 
independent of an expression for it 


With these words Riemann offered an approach to the study of functions that did 
not depend on any particular expression for it, and accordingly played down the role 
of long complicated manipulations of such expressions (such as Jacobi or Kummer 
had developed). It moves the study of complex functions firmly over to the study of 


3See Gottingen, Akt Nr. 37. 
4See Roch (1863, 19-28). Roch’s lectures are considered in Sect. 17.2. 
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harmonic functions, which are determined by their boundary values, but now in a 
context where functions may have some singular points. 

At this time, and arguably for many years afterwards, it was usual to allow complex 
functions to be many-valued, as the functions that take the nth root or the logarithm 
of a complex variable are. Riemann dealt with this by extending the domain of 
such a function to region T that is multiply connected and then passing to a simply 
connected region T* by making boundary cuts. In this way a many-valued function 
is obtained which jumps as one crosses the cuts, and a branch of an n-valued function 
is extended to a function on a larger domain. 

Riemann also spelled out that he considered that his claim about the existence 
and uniqueness of functions, given suitable information about their values on the 
boundary of their domain, applied to arbitrary simply connected domains and not 
merely regions of the plane. 

In Riemann’s opinion, his approach greatly simplified such tasks as deciding when 
two expressions represent the same function. Riemann did consider whether the class 
of complex functions as he defined them agreed with the class of functions definable 
by a finite or infinite number of operations of addition, multiplication, subtraction 
and division, and he claimed that they did. Had that been true his programme might 
have had an easier time, but Weierstrass’s later refutation of that claim was to be one 
of his reasons for disputing the fundamental validity of Riemann’s approach (see 
below, § 20.4). 

But even on its own terms Riemann’s approach raises an urgent uniqueness ques- 
tion that his mapping theorem was designed to address. 


16.5 The Riemann Mapping Theorem 


As we have seen, Riemann’s approach to the study of the possible domains of complex 
functions was to divide a domain into simply connected pieces. His defence of 
the Dirichlet principle allowed him to assert the existence of complex functions on 
simply-connected domains, although must be said that few mathematicians at the 
time, and even for half a century afterwards, were persuaded by this existence claim, 
and under the influence of Weierstrass and people around him, it came to be believed 
that the argument rested on an appeal to Dirichlet’s principle and had therefore to be 
rejected. 

But from Riemann’s perspective, it was natural, and perhaps essential, for him 
to see if, from the point of view of his theory of complex functions, there was 
essentially only one simply connected domain or if there were many. This is the 
topic he addressed in the second part of his paper of 1851, where Riemann offered 
an argument to show that? 


5See Riemann (1851, § 21). 
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Two given simply connected plane surfaces can always be mapped onto one another in sucha 
way that each point of the one corresponds to a unique point of the other in a continuous way 
and the correspondence is conformal; moreover, the correspondence between an arbitrary 
interior point and an arbitrary boundary point of the one and the other may be given arbitrarily, 
but when this is done the correspondence is determined completely. 


This important result, in a somewhat more precise form, is nowadays called the 
Riemann mapping theorem. It establishes not only that there are complex functions 
defined on any simply connected region, and indicates how they can be defined, 
but also that any two such regions are equivalent for the purposes of Riemannian 
complex function theory. 

The claim amounts to showing—as an example of the application of his general 
results, as he put it (1851, §21)—that any two simply connected planar regions can 
be mapped conformally onto one another. To do this, Riemann began by noting that it 
was enough to show that any such region T can be mapped conformally onto the unit 
disc. To obtain the conformal map of T to the unit disc, and to show that it extended 
to the boundary, Riemann took local coordinates z — zo defined on a suitably small 
disc around an arbitrary interior point of 7, cut this disc along a radius, and defined 
the map f(z) = log (z — zo) = log (r) + wi on that disc. He then extended the cut to 
a specified point on the boundary of 7, and the function f to a continuous function 
a+ (i defined on the whole of T that agreed with f on the boundary of the disc and 
had these properties: 


1. a vanished on the boundary of T, 
2. a+ (i jumped across the cut by 277 (like log). 


He then argued that, by his general principle, there is a complex function u + iv 
with suitable jumps. The function wu will take every value from —oo (at zo ) to 0 (on 
the boundary of T). Then, because T was simply connected and the function u was 
harmonic, the level sets u~!(a) are single simple closed curves for every value of a. 
Therefore, he concluded, the sought-for function was 


et» T + D 


After a few remarks indicating that the entire theory could readily be extended to 
regions which are not simply connected, this remarkable paper came to an end. 


16.6 Reactions to the Paper of 1851 


Riemann’s thesis was not published well enough for it to have a significant impact. As 
was the custom at the time it printed as a separate thesis and distributed to a number of 
German universities, but it was not reprinted in a journal such as Crelle’s Journal fiir 
die reine und angewandte Mathematik. This is the main reason his approach did not 
spread in the early 1850s. It is also clear that anyone who read the paper would have 
encountered many troubling novelties that show up in its vagueness at certain crucial 
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points. There is the striking and emphasis on arbitrary (and above all non-planar) 
domains, and topological concepts such as cuts and connectivity. A complex function 
is both a single object with characteristic properties (complex differentiability, the 
Cauchy—Riemann equations, conformality) and the sum of its real and imaginary 
parts which, being harmonic, are reduced to their boundary behaviour and their 
singular points. Dirichlet’s principle, whether assumed or ‘proved’, lurks at the heart 
of the paper. It is entirely right that Lars Ahlfors, a leading complex analyst of the 
20th century, was to say that Riemann wrote “almost cryptic messages to the future” 
and that his mapping theorem is in a form that “would defy any attempt at proof, 
even with modern methods”. 

The contrast with the approach of the previous generation was stark. In the hands 
of a master such as Jacobi, complex functions were given, indeed defined, by certain 
formal expressions: infinite series, infinite products, implicit functions, and integrals. 
His theory of elliptic functions, although daunting, had a directness about it: expres- 
sions for functions yielded information about the functions. Riemann’s presentation 
was meant to change the type of questions one could ask about a complex function, 
away from the computational and towards the conceptual. 


16.7 Riemann Surfaces and Conformal Maps 


Riemann Surfaces 


Riemann surfaces were greeted as a very difficult idea, as we shall discuss in 
Chap. 18. Here a picture by Gustav Holzmiiller, an otherwise minor mathemati- 
cian who excelled at providing pictures to illustrate theorems, has a go at explaining 
what two branch points of order three might look like (see Holzmiiller 1882, Fig. 49) 
(Fig. 16.1). 


Fig. 16.1 Two branch points a 
of order 3 


welche der Fusktion ee eh awh 


‘Ricmanseche Fiuche, 
Zao + age + 052 + ty extepricht, 


Wr. 1458 HOLZMULLER Uber die konforme Abbaaung. 1650 
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Fig. 16.2, A Holzmiiller —s 
picture of 5 (z + 1) tobene 


Fig. 16.3. A Maple picture a 2 27 
of 3 (z+ 4) 


~21 


Conformal Maps 

Holzmiiller also enjoyed depicting the effect of various maps on the complex plane. 
This one (Holzmiiller 1882, Fig.40) is an attempt to describe the map (later used in 
the Zhukovskii aerofoil) Z = 5 (2 + 1). The map is 2—1 except at +1. The points 
ti map to 0. The comparison with the Maple map is instructive (Figs. 16.2 and 16.3). 


Chapter 17 
Riemann’s Later Complex Function Theory 


17.1 Introduction 


Riemann continued to develop his own ideas, extending them to multiply-connected 
domains defined by algebraic curves. He presented them in public when he lectured 
on complex functions, in particular elliptic and Abelian functions, in 1855/56 and 
again in 1861/62, and he published them in his remarkable paper on abelian functions 
in 1857. 


17.2 Riemann’s Lectures from 1855/56 to 1861/62 


A number of sources have survived that illuminate how Riemann lectured on com- 
plex function theory during his years as a Professor in Gottingen, and they make 
for an interesting comparison with his published papers. Some extracts were pub- 
lished in the first edition of the Werke, and in the Nachtrédge that accompanied the 
second edition. In 1896 and 1899 Hermann Stahl published his own notes on the 
Riemann’s lectures on the theory of Abelian functions and the theory of elliptic 
functions respectively. 

Riemann’s lecture course of 1861/62 was also written up by Roch and published 
in 1863 and 1865, making them one of the few sources for his ideas that were 
available in his lifetime. In it, complex numbers are first explained geometrically, 
as corresponding to points in the plane, and then algebraically as the end result of 
a process of inverting the operations of arithmetic (for example, inverting addition 
to obtain subtraction). One begins with the natural numbers and proceeds to the 
integers, the rational numbers, and finally the complex numbers. The interpretation 


‘Riemann also applied these ideas to the theory of the hypergeometric equation; I plan to discuss 
this in a later course of lectures. See also Gray (2000a). 


© Springer International Publishing Switzerland 2015 175 
J. Gray, The Real and the Complex: A History of Analysis in the 19th Century, 
Springer Undergraduate Mathematics Series, DOI 10.1007/978-3-3 19-23715-2_17 


176 17 Riemann’s Later Complex Function Theory 


that goes back to Wallis” of ,/—1 as a mean proportional between +1 and —1 was 
given, and Gauss’s views of 1831 were supported, and then it was conceded that 
the geometric was more important for making complex quantities intuitive than for 
establishing their reality. 

The lectures then presented the definition of a complex function w as a complex- 
valued euneHon that is complex differentiable, and therefore satisfies the equation 
Ge = =i? 5, This ensured that its derivative depended only on the position of z but 
not on the direction of dz. Roch (but not Riemann in 1861) then observed that a 
complex function is conformal except where the derivative vanishes; Riemann also 
pointed out that the real and imaginary parts of the function are harmonic, although 
he did not use the word. 

The integral of a complex function along a path was then defined, which led to the 
question of when the integral around a closed path vanished. Riemann, as he had in 
his (1851), took a Green’s theorem approach, proving that under suitable conditions 
on functions X and Y 


// (5 - Fi) axa = [oars vay 
Oy 


where the first integral is taken over a plane region and the second integral over its 
boundary. When X = w and Y = iw, uy — ox = = 0 and so Riemann concluded 
that the integral of a complex function around a closed contour vanished when the 
function w was everywhere finite and continuous inside the contour. 


What happens when w becomes infinite at a point in the region? The example 
i ge log z shows that in such cases the integral around a closed contour need 
i 2 


not vanish. Riemann showed that if a function f(z) defined in a region is such 
that there is a point a; in the region where lim,.,, f(z) is infinite as z > aj 
but lim;-.a, (z — ai) f(z) is finite and equal to cj, say, and the function is finite 
everywhere else in the region, then the integral of taken clockwise around, is 27c;. 
From this, Riemann deduced that if a function f is analytic everywhere in the interior, 
and f is an arbitrary interior point, then 


ol f() 
f= xo | Paz 


and the nth derivatives satisfy: 


iO 


These are the Cauchy integral formulae, which Riemann could have read— 
along with the strange term “finite and continuous’—in Cauchy’s Exercises de 


2See Wallis, Algebra, 1673, Vol. 2, Ch. 66, and the English translation in Smith, Sourcebook, 46-54. 
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mathématiques, which he borrowed from the Gottingen University Library when 
he was a student in 1847. 

These formulae allowed Riemann, as they had Cauchy, to show how a complex 
function can be developed as a convergent power series on a suitable disc, and as 
a convergent Laurent series inside an annulus. Furthermore, if a function vanishes 
along a curve all the coefficients in its power series expansion vanish, so the only 
complex function that vanishes along a curve is the zero function, and any identity 
between functions that holds for real values of their arguments holds also for complex 
values. 

Now that Riemann had shown that the complex functions are the ones that, at least 
locally, have power series expansions, he could investigate properties of complex 
functions by looking at their power series representations, and in this way he began 
the investigation of the extent to which a function is known up to a constant when 
functions are given which have the same infinities. 

Riemann said that a function f is infinite of order m at z = a if it is the case 
that f(z)(z — a)! = oo when z = a but f(z)(z — a)” does not (later, people 
said that such a function has a pole of order m; the term was introduced in Neumann 
(1865, 38)). Riemann also similarly defined what it is for a function to be infinitely 
small of order m (i.e. have a zero of order m). This allowed him to show that if a 
function has only finitely many points, including z = oo, where it is infinite, then 
it is a rational function, and if it is never infinite even at z = oo then it must be a 
constant. Riemann gave two proofs of this result, which he did not call Liouville’s 
theorem: one by looking at the power series expansion, and another using the fact 
that under the stated conditions the integral 


. f @) 
poe f a 


taken on a circle centre the origin approaches a constant value as the radius tends to 
infinity. This is because it can be regarded as the integral around z = oo and that is 
finite by earlier results. 

Riemann finished this part of his course by giving a proof of the fundamental 
theorem of algebra. He first looked at the integral of the logarithmic derivative of a 
function f and showed that 


1 f'@ 1 [Rta (17.1) 


dz= 
Imi | FQ” Ii dz 


evaluated on the boundary of a region, is equal to the number of zeros minus the 
number of poles inside the region, counted according to multiplicity. 

Now, a polynomial of order 7 is infinite to order n at z = ov, but (17.1) vanishes 
for such functions, so the single nth order pole of the polynomial at infinity, which 
is its only pole, must be balanced by a total of n zeros. 
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Riemann also gave another proof, which, as he remarked, is essentially Gauss’s 
third proof of the fundamental theorem, and proceeds by evaluating (17.1) for the 
function £ a , and interpreting the answer, n. 

Riemann finished the course by showing how some real definite integrals can be 
evaluated by contour integration, and then giving a brief description of the branch 


points of a many-valued function. 


17.3 Riemann’s Theory of Elliptic Functions 


As one might expect, Riemann’s geometrical theory of functions was well suited to 
deal with elliptic functions, so much so that one wonders at what stage Riemann had 
that aim in mind. Unhappily, there is no evidence to illuminate that question, so we 
must set it aside. 

His lecture courses on elliptic and algebraic functions were usually offered after 
the courses on complex function theory, and the lectures for 1861 are interesting 
because in them he showed for the first time how the geometry of the complex curve 
(which is a real two-dimensional surface) defined by the function y? = f(x), where 
f (x) is a cubic or quartic in x, illuminates the whole study of elliptic integrals and 
elliptic functions.* Riemann’s grasp of the surface as a branched covering of the 
sphere enabled him to do what both Jacobi and Cauchy had been unable to do and to 
keep track of the values of the integrand (which, being a square root, is a two-valued 
expression). This was the first time that Cauchy’s approach to complex function 
theory was made to apply to many-valued integrands apart from Puiseux’s (1850) in 
which had investigated integrals of algebraic functions. 

In these lectures Riemann confined himself to the traditional cases when the roots 
of the quartic are either real or come in complex conjugate pairs, but there is no 
reason to suppose his analysis was restricted to these cases—the most likely reason 
is pedagogical. 

Riemann took the surface T corresponding to the curve, which is a double cover 
of the sphere branched over the zeros of f(z) and at z = oo when f(z) is a cubic, 
and cut it into a simply connected region T’ by two cuts a and b. Then he showed 
that the elliptic integral in its normal form maps T’ onto a copy of a rectangle in 
the complex plane. The sides of the rectangle are determined by the periods of the 
integral, and the rectangles tile the plane. The value of the integral depends on the 
path and the end point of the integral determines the value of the integral only up to 
a sum of periods. This illustrates the many-valued nature of the integral. The net of 
rectangles to the plane displays the doubly periodic nature of the inverse map. The 
periods arise as integrals along the cuts, and Riemann had now shown how they arise 
by integrating along non-contractible closed curves on a torus. 


3For the published course see Stahl (1899); the chapter on the presentation of the theory of theta 
functions was taken from the course of 1856. 
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Riemann’s account of Jacobi’s theory of theta functions also displays a novel use 
of his theory of functions. He defined a theta function by a power series. To get the 
Jacobian elliptic functions as quotients of theta functions he wrote down a quotient 
that had the correct zeros and poles and then multiplied by an exponential factor to 
get the periods right, and used a little trick to get the constant multiple right. The 
result that the elliptic function equals this quotient then followed from Liouville’s 
theorem. 

In the course of 1856 he had given an alternative account: he defined the period 
parallelogram, P = {0, 1,7, 1+ 7} for the theta function, pointing out that it was 
not, strictly, a periodic function. Integration around the boundary showed that it had 
exactly one zero inside P, which could be found explicitly from the power series to 
be at 7/2. Thence, following Jacobi, the other three theta functions, and a suitable 


2 
| ) was then doubly periodic with a double pole at 7/2. 


From which it followed by counting poles that ae 


and so ./x = sn(2Kv). 

The lecture course of 1856 is therefore more traditional in its approach, but it 
is also interesting because it was a try out for some of the material that was later 
published in the great memoir of 1857 on Abelian functions, to which we shall turn 
after quickly looking at a Riemannian treatment of elliptic integrals and functions 
that was given in 1864 and which helps us understand what Riemann was doing. 


quotient (x := a? ( 


was the square root of a cubic, 


Durége on Elliptic Integrals 


In his book of 1864, Heinrich Durége took 14 pages (pp. 187—201) to connect elliptic 
functions back to the corresponding elliptic integrals. It is interesting to follow his 
account, because it illustrates what had to be done to make Riemann’s geometric 
ideas comprehensible to an audience for whom they were unfamiliar. 

Durége considered the elliptic integral 


v=[ dz 
Jo JG =2)0 = 222) | 


As he observed, there are two values of w for each value of z except where z = 
+1,-1,+ i - i He expressed this awkwardly and arguably misleadingly by saying 
the z-surface always has two leaves and has four points of discontinuity, which are 
called its branch points. These points are not to be excluded, because at them the 
integrand is only infinite with order 7 Nor is the point co to be excluded, because 
there zf(z) = lim z-+00 


To render the surface simply connected, Durége had to choose cuts in the surface 
that go between the branch points; he chose to join +1 and —1 and to join +4 and 
—;. This led him to Fig. 17.1. 

The transition from a continuous to a dotted line is made because a loop such as 
q2 crosses from one sheet to the other, whereas a loop such as q; stays entirely in one 
sheet of the surface. He then explained the many-valued nature of the elliptic integral 
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Fig. 17.1 Durége’s figure of 
a simple elliptic function 


by tracking the values of the integral along the loops gq; and q2, and he depicted the 
infinitely many values of the integral by means of a parallelogram lattice. 

Then, he said (on p. 193) “We also want to follow Riemann and make the con- 
nection between the doubly periodic function and the elliptic integral in the opposite 
direction, namely starting from the doubly periodic function.” To this end he sup- 
posed he had a doubly periodic continuous function ¢, so 


o(w + Ai) = d(w + Az) = ow), 


for some complex numbers A; and A». He argued that the ratio of these numbers 
cannot be real, or the function would either be simply periodic or constant. By an 
earlier result—Liouville’s theorem although Durége gave it no special name—the 
function must have a pole somewhere. On integrating the function around a period 
parallelogram and applying the Cauchy residue theorem the sum of the residues must 
vanish, so such a function cannot have just one simple pole and must have at least 
two. If it has two they have equal and opposite residues, and in this, the simplest 
case, the function can be written as 
c c 
o(w) = —— —- —— + yw), 
w-r w-s 

where the poles are at w = r and w = s and ~ is a function that is not infinite at 
any of the points r + m,A,+m2A2 or s +m ,A, +m2Az2. Durége then showed that 
o(w) = d(r + 5s — w). Moreover, 


Cc 
(w —r)? 


d 
¢'w) = = + + uw). 


(6; 
(w —s)? 


This function has two double poles and so, as a function of w, it takes every value 
four times in each period parallelogram. The derivative is a single-valued function of 
: Zz : : 
w but, Durége then showed, a two-valued function of z, so (=) is a single-valued 
w 


Aen? 
function of z. Whence — is of the form C(z — a)(z — 2)(z — 7)(z — 6) and 
w 


one has returned to the original elliptic integrals. 
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17.4 Riemann’s Paper of 1857 and Complex Functions 


Riemann’s paper of 1857 on Abelian functions is undoubtedly one of the most impor- 
tant papers on mathematics published in the 19th century. Unlike the (1851), Rie- 
mann’s (1857c) was published in the principal mathematical journal of the day, the 
Journal fiir die reine und angewandte Mathematik, which explains why Riemann 
repeated many of his earlier ideas. Its importance was appreciated at once by Weier- 
strass who, as we shall see in Chap. 19 had already been working on the topic for 
some time, and who, on reading Riemann’s account withdrew a paper he was writing 
on the subject from publication. He was not respond in print until 1869. 

The topic is the integrals of arbitrary algebraic functions. As we saw in Chap. 3, 
the subject of elliptic functions grew out of the study of integrals of the form 


dx 
z= | —, 
V8 (x) 


where the function y = g(x) is a cubic or quartic in x. Geometrically, this can be 
considered as studying integrals of the form 


ve = [ fo. »ax 


on the complex curve y” = g(x), although investigations mostly avoided this inter- 
pretation in favour of strictly analytic methods. By the early 1850s attempts had 
been made to go beyond the quartic case and to contemplate integrals of the form 
z= | f(x, y)dx where there is an algebraic relation between x and y of the form 
g(x,y) =0. 

In two papers of 1854 and 1856 Weierstrass made his name by resolving a special 
case of integrals of the form [ ids on acurve with equation y* = g(x), where g(x) 
is of degree n > 4. This, the so-called hyper-elliptic case, had been studied by some 
earlier writers, including Hermite and some students of Jacobi. But in his (1857) 
Riemann dealt with integrals of the form [ f(x, y)dx on any plane algebraic curve 
g(x, y) = 0 whatever. 

To do so, Riemann pushed the ideas in his (1851) from being a good indication 
that complex function theory can be done on any ‘surface’ to showing in detail how 
it could be done on complex algebraic curves described by equations g(x, y) = 0, 
although he only gave details for curves having at most some double points in the 
complex plane and no more complicated singularities. 

Riemann called the space of points (x, y) such that g(x, y) = 0a surface T. He 
described how to dissect T by either closed curves or curves with end points on the 
boundary of the (partially dissected) surface until a simply connected surface T’ is 
obtained. He explained how to think of integrals on 7, and discussed when an integral 
may be independent of the choice path joining its end points. To introduce the idea of 
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Fig. 17.2. A loop from P to P along a curve o that crosses a cut 


complex functions defined on T he summarised his earlier use of Dirichlet’s principle 
and concluded that there is a unique function on T’ with prescribed singularities in T’ 
and whose real part may be defined arbitrarily along the cuts that form the boundary 
of T’. The whole aim of the paper was to show how to study the integrals of complex 
functions on 7, and the use of geometry was wholly new. 

Riemann supposed that g(x, y) = 0 was irreducible, and of degree n in x and m 
in y, and regarded it as representing as an n-sheeted covering T of the y-plane; there 
are, in general, n values of x for each value of y. Points at which there are fewer than 
n values of x are called branch points. 

Riemann remarked that rational functions of x and y (which, by definition, are 
the quotient of one polynomial in x and y by another), are single-valued functions 
on T that are branched like x, and he proved the converse later in the paper. The 
integral of a rational function yielded a many-valued function, because the integral 
may be conducted along inequivalent paths that wind around the branch points. 
However, these different analytic continuations differ only by constants, because 
their derivatives at the same point necessarily agreed. It was this system of functions 
that Riemann studied. 

Suppose that P is point inside T’ and that Q; and Q> are corresponding points 
on two parts 7 and 2 of the boundary of T’ (so 7; and 7 are the two sides of a cut 
y on T and Q, and Q> correspond to the same point on JT, as shown in Figs. 17.2 
and 17.3). Consider an integral of a rational function f(x, y)dx on T along a simple 
loop o starting at P and passing through Q; = Q>. This path appears on 7’ as a path 
from P to Q; anda path from Q> to P. We do not expect { f(x, y)dx taken along 
o from P to P to vanish, rather, it should be a non-zero period of the integral. This 
means that the values of the integrals 7 ' f(x, y)dx and [>° f(x, y)dx (where the 
second integral is taken along a in the reverse direction) will differ, by an amount 
that Riemann called the jump of the integral at Q; = Qo. 

Now consider a similar figure in which the loop o through P deviates along the 
path o2 and crosses ¥ at another point R, say, as shown in Fig. 17.4 and suppose that 
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Fig. 17.3. The loop o as it appears in T’ 


Fig. 17.4 The loops a2 and a2 as they cross the cut y 


no branch points of T are contained in the region bounded by oj and o2. Then, by 
what Riemann regarded as a consequence of Green’s theorem’ the integral taken in 
the clockwise direction around o, and the anticlockwise direction around a7 is zero, 
so the contributions of the integrals along a; and 2 to the integral along the loop 
that starts at P are the same, and therefore the jump of the integral at Q; = Qo is 
the same as jump of the integral at Rj = R2 (the points on the boundary of 7’ that 
correspond to R). 

So the conclusion is that the jump of the integral of a rational function across a 
cut is constant along a cut. It will, of course, vary from cut to cut. 

Now, a complex function that is bounded on all of T’ and does not jump across 
any cut will correspond to a complex function that is bounded on all of T, and 
by Liouville’s theorem (not that Riemann gave it a name) such a function must be 
constant. But an integral with a non-zero jump on at least one cut corresponds to 
a many-valued function on 7, the different values being determined by the path of 
integration. So Riemann now had a way of generating non-trivial functions on T, at 
the price of allowing them to be many-valued. 


4 And we might regard as the Cauchy integral theorem. 
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To produce single-valued functions, Riemann used the fact that integrating a 
function around a point where the function has a simple pole produces a many-valued 
function. He argued that it should be possible to make the contributions balance, so 
that the integral of a function with simple poles has jumps of zero across every cut. 
There will be no contradiction with Liouville’s theorem, because the integral is now 
infinite at the points where the function has simple poles. 

The surface T will be made simply connected by a system of, say, 2p cuts. These 
render T into a disc-shaped surface with a boundary of 4p: sides that are to be 
identified in pairs, and therefore 2 p jumps. It remained for Riemann to show how the 
two ways of introducing multi-valuedness could be made to cancel each other out, 
and his conclusion was that there was a system of single-valued complex functions 
on T having simple poles at m points of dimension at least m — p + 1.5 When 
m— p+1=1, orm = p, the corresponding functions are all constant, and there 
are non-constant functions as soon as m > p. 

This happily agrees with what is already known. When T is a sphere, for which 
p = 0, there are non-constant functions having only one pole; they are the functions 
of the form f(z) = acth with a simple pole at z = —d/c. When p = 1, which is 
the case for elliptic functions, there are non-constant elliptic functions with precisely 
two poles. 

In the second part of this remarkable paper Riemann showed how to generalise 
the theory of theta functions from one to several variables, which involved him in 
discussing functions in not one but p > | variables, This topic remains graduate 
work to this day and is out of reach here. 


>This is the so-called Riemann inequality. It was refined to an equality in Roch (1865). 


Chapter 18 
Responses to Riemann’s Work 


18.1 Introduction 


The impression is given in several places that few people responded quickly to 
Riemann’s work, and those that did so with a view to finding fault with it, even 
arguing that it should be dismissed. It is true that some mathematicians began their 
papers by lamenting that nothing seems to have been done, but in fact his papers, with 
a few notable exceptions, drew a considerable amount of attention, as this chapter 
discusses. 


18.2 Critical Acceptance 


Elementary complex function theory on Riemannian principles was presented in 
Durége’s book of 1865, the first book on the subject at all in German, and it was 
echoed in a long article by Oscar Schlémilch (1866). In 1868 Casorati published his 
explicitly Riemannian presentation of complex function theory for an Italian audi- 
ence. More advanced topics were also treated on Riemannian lines: Clebsch (1864) 
discusses the concept of genus and explains Abel’s theorem; Neumann (1865) is 
about hyperelliptic integrals; Clebsch and Gordan wrote their (1866) on Abelian 
function theory; in the late 1860s Lazarus Fuchs (1865, 1866) generalised the hyper- 
geometric equation to a class of complex ordinary differential equations all of whose 
solutions are meromorphic everywhere (now called equations of the Fuchsian type). 
Riemann’s work was corrected and extended by several authors: by Prym, Christof- 
fel and Schwarz on the Riemann mapping theorem in the 1860s, and by Schottky in 
his (1877), who extended the Riemann mapping theorem to non-simply connected 
domains. Yet other authors took up the topological aspects of Riemann’s function 
theory, his introduction of trigonometric series, and his differential geometry, and in 
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the end only three of Riemann’s papers were left unattended: the one on the prime 
number theorem, the one on shock waves, and the one on Bessel’s equation. 

Moreover, Weierstrass had proposed Riemann in strong terms to the Berlin Acad- 
emy of Sciences in 1859. When they met they apparently got on well together, 
although it is true that in later years Weierstrass, egged on by his acolyte Schwarz, 
was more and more opposed to Riemann’s way of thinking. The importance attached 
to producing the first edition of Riemann’s Werke, notwithstanding its many delays, 
and editing his unpublished papers, lecture notes, and fragments, also suggests a real 
willingness to keep his ideas alive after his premature death. 

People who supported Riemann’s approach to complex function theory empha- 
sised the importance of considering the global nature of the domain of a complex 
variable, and at least in simple cases gave a topological analysis of the domain in 
terms of cuts in the Riemann surface. They allowed the appeal to Dirichlet’s princi- 
ple to establish the existence of complex functions on various domains and having 
prescribed singularities, although they stopped short of attempting a proof of that 
principle. But by and large, and certainly in books, they did not follow him into the 
deeper reaches of Abelian function theory. 

Even these matters raised problems. For example, Clebsch’s seven distinguished 
obituarists (Von der Miihll et al. 1874) listed as one of the three hardest problems 
confronting anyone who approached Riemann’s work the problem of understanding 
the topology of an algebraic curve and, conversely, of passing from an equation to 
the form of the corresponding Riemann surface. In the 1860s Clebsch and Liiroth 
described how a Riemann surface may be cut up along loops that encircle the branch 
points in particularly simple ways, and further contributions were made in Clifford 
(1877). 

As for the contentious Dirichlet principle, when Klein set about implying that he 
was Riemann’s natural successor he suggested that Riemann had advocated Dirich- 
let’s principle on physical grounds, but as Bottazzini has shown in his (1977) former 
students of Riemann’s, notably Prym, disagreed. In fact mathematicians all agreed 
with Riemann that the principle needed a proof—only physicists such as Helmholtz 
and Maxwell disagreed. The problem was, as we shall now see, that a proof seemed 
to be possible only under restrictions that Riemann’s gestures had not indicated. 


18.3 Prym Refutes the Dirichlet Principle 


Ironically, Riemann’s former student Friedrich Prym was to be the first person to 
refute Dirichlet principle. His example (Prym 1871) is interesting. He began by 
observing that Dirichlet’s principle implies that an arbitrary continuous function, 
u, defined on the boundary of a disc extends to a finite and continuous harmonic 
function defined on the whole of the disc. As he said, the weak point, in all known 
proofs of this result could be found by exploiting other of Riemann’s ideas, for they 
relied on the claim that the function u was identical with its Fourier series, and it 
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was a misapprehension to think that Riemann had proved that an arbitrary function 
is representable by its Fourier series.! In fact the best result was still due to Dirichlet 
and applied to functions having only finitely many maxima and minima. 

Prym then dug deeper, and showed that even when Dirichlet’s problem can be 
solved, Riemann’s use of the Dirichlet principle could be invalid. He considered a 
branch of the complex function 


u+iv=i/—In(R+x+iy) 


defined on adisc ofradius R < is and introduced polar coordinates p and t centred on 
the point (— R, 0). In the disc p took every value from 0 to2R < | andt every value 
from —z to z. The branch of logarithm taken was to satisfy —In(R+x +iy) = 
—Inp—it. 

The explicit form for u and v in terms of polar coordinates shows that the functions 
u and v are everywhere defined and single-valued, even on the boundary of the disc, 
and since the function u is the real part of a complex function it is certainly harmonic. 
So the Dirichlet problem is solved in this case. However, as Prym then showed, 
Dirichlet’s integral L (u, 0) is infinite. The reason, as his formulae make clear, is that 
the function u oscillates infinitely often in any neighbourhood of the point p = 0. 
Consequently there is no hope that step 2 of Riemann’s argument (L (1) < oo ) could 
be made to work. So Dirichlet’s principle can fail, even when Dirichlet’s problem 
can be solved. 

Although Weierstrass’s many students and more and more of his contemporaries 
preferred to use Weierstrass’s example of an integral bounded below and that does not 
attain its bounds as the way to show that Dirichlet’s principle is unreliable, the first 
direct refutation of it came from within the group of Riemann’s students. Whether it 
was decisive is hard to say, but widespread use of the principle in Riemann’s work 
meant that once it was called into question much else of what Riemann had done 
was also put in doubt. 

Weierstrass’s example was set out in one of the few papers that Weierstrass 
published—he generally preferred to confine himself to lectures in Berlin—and it is 
likely that each of these papers was intended to shake mathematicians out of their 
complacency. In his (1872) he raised the subject of Dirichlet’s principle,explained 
that if the Dirichlet integral exists and attains its minimum then the minimising func- 
tion is harmonic and unique, and then he turned to the existence question. Here he 
offered what he called a simple example to show the inadmissability of Dirichlet’s 
reasoning. He observed that 


'Prym cited Hankel, another of Riemann’s students, as having made this mistake. 
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where g(—1) = a 4 b = (1) is always positive and can take any non-zero value 
however small, but cannot take the value zero unless a vanishes on the interval 
[—1, 1], which is ruled out by the boundary conditions. 

An impression of how things stood by 1900 is given by the remarks David Hilbert 
made in what proved to be—as it was intended to be—a speech that set the agenda 
for the opening decades of the 20th century. 

At the end of the 19th century, two mathematicians stood as intellectual leaders: 
Henri Poincaré in France and the younger figure of David Hilbert in Germany. Hilbert 
was invited to be one of the keynote speakers at the second International Congress 
of Mathematicians, to be held in Paris in August 1900. His close friend Minkowski 


urged him to seize the day, writing to him: 


Most alluring would be the attempt to look into the future, in other words, a characterisation 
of the problems to which the mathematicians should turn in the future. With this, you might 
conceivably have people talking about your speech even decades from now. Of course, 
prophecy is indeed a difficult thing. 


After delaying so long that the invitation was nearly withdrawn, Hilbert wrote was 
has become probably the most cited single paper in mathematics, his address on 
the Problems of Mathematics. In it he presented two fruitfully contrasting views on 
mathematics: on the one hand, mathematics for its own sake; on the other mathematics 
in the service of science. He argued that high standards of rigour and understanding 
brought these strands harmoniously together, and then offered some 23 problems, 
several in several parts, as good problems to work on. Such was Hilbert’s prestige 
as the years went by that mathematicians have felt especially honoured when they 
solved one of the problems.’ Few remain unsolved today; one of the few to do so is 
the Riemann hypothesis. 

Among the problems Hilbert singled out was the Dirichlet problem and the Dirich- 
let principle. 


20. THE GENERAL PROBLEM OF BOUNDARY VALUES. 

An important problem closely connected with the foregoing is the question concerning the 
existence of solutions of partial differential equations when the values on the boundary 
of the region are prescribed. This problem is solved in the main by the keen methods of 
H. A. Schwarz, C. Neumann, and Poincaré for the differential equation of the potential. These 
methods, however, seem to be generally not capable of direct extension to the case where 
along the boundary there are prescribed either the differential coefficients or any relations 
between these and the values of the function. Nor can they be extended immediately to 
the case where the inquiry is not for potential surfaces but, say, for surfaces of least area, 
or surfaces of constant positive Gaussian curvature, which are to pass through a prescribed 
twisted curve or to stretch over a given ring surface. It is my conviction that it will be possible 
to prove these existence theorems by means of a general principle whose nature is indicated 
by Dirichlet’s principle. This general principle will then perhaps enable us to approach the 
question : Has not every regular variation problem a solution, provided certain assumptions 
regarding the given boundary conditions are satisfied (say that the functions concerned in 


2See Riidenberg and Zassenhaus (1973, 119). 
3See Gray (2000b) and Yandell (2002). 
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these boundary conditions are continuous and have in sections one or more derivatives), and 
provided also if need be that the notion of a solution shall be suitably extended?* 


Hilbert did not entirely establish a solution in his own later work, but his bold idea 
was to bridge the gap between mathematicians and physicists by delineating a class of 
boundary conditions which were general enough to cover the physical situations and 
yet narrow enough to finesse the mathematical difficulties. And if pressed, Hilbert 
admitted, one might weaken what was meant by a solution a little. In short, Hilbert 
raised the issue of establishing, for a large class of equations including several that 
arise in various physical problem, theorems that guarantee the existence of solutions 
in a possibly novel sense to be made precise by future research. Much of the research 
agenda in partial differential equations to this day was thus established for the first 
time. 


18.4 Other Responses 


When Dirichlet died in 1859 Rudolf Clebsch was appointed as his successor in Gét- 
tingen, and he took the opportunity to see what could be done to promote Riemann’s 
ideas (by this time Riemann was already very ill and was spending as much time as 
he could in Italy to preserve his failing health). Clebsch had graduated from Jacobi’s 
school in K6nigsberg and initially worked on elasticity theory and hydrodynamics. 
His first major paper after his turn to Riemann is his (1864), where he endeavoured 
to find applications for abelian functions in the study of algebraic curves that could 
match those already well known and understood for elliptic functions. As he put it 
(1864, 189): 


The such applications have not yet been sought, although we have had Riemann’s theory of 
these functions for six years, can without doubt be put down in large part to the difficulties 
in the way of understanding the work under consideration, and which the recent efforts of 
younger mathematicians have not quite alleviated. 


In the 1860s Clebsch emerged as the leader of a strong group of algebraic geome- 
ters who saw themselves as leading a wave of new mathematics in just those areas 
neglected by Gauss. In this spirit they tried to rewrite Riemann’s theory of algebraic 
curves in the language of plane projective geometry, and to find a largely algebraic 
path from there to analysis. This appalled Riemann’s surviving pupils. Clebsch and 
his younger colleague Paul Gordan set out their approach in detail in their book 
(1866), and Prym wrote to Casorati in 1866 that “The attempt to base function the- 
ory on algebra is completely useless” and that they would never have dared to publish 
the foreword in Riemann’s lifetime. 

What enraged Prym was the preference for algebra over topology. However imper- 
fectly, Riemann had attempted to get away from defining a Riemann surface by 


4By a regular problem he means one such as the Dirichlet problem. Here he referred to his own 
lecture on Dirichlet’s principle: (Hilbert 1900, 1904). 
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an equation, and had attempted to prove that any finite Riemann surface could be 
described by (an equivalence class of) equations. This separated the surface, thought 
of abstractly, from any embedding it might have in some complex projective space; 
the Clebsch-Gordan treatment collapsed that distinction. The surface was regarded 
as a complex curve in the complex plane, and the genus was defined in terms of the 
degree of the equation defining the curve and the number of its double points and 
cusps (curves having worse singularities were for the time being excluded). Even 
though they could show that birational transformations of the surface did not change 
the value of the genus (at least for curves that were not too singular) their definition 
was evidently not intrinsic. 

The function-theoretic side of the book was more successful. Whereas even Rie- 
mann’s followers had to admit that his theory of theta functions was not intimately 
connected to the rest of the theory, the book spelled out at length the connection 
between theta functions and integrals of the third kind, which Riemann had only 
sketched. 

However, even Clebsch was not long enlisted in Riemann’s cause. He was to have 
edited Riemann’s Werke, but he died unexpectedly of diphtheria in 1872 when only 
39. The task passed to Riemann’s friend Dedekind, and a younger mathematician, 
Heinrich Weber, who was more sympathetic to Riemann’s ideas. 

The people Clebsch left to pursue his ideas continued to prefer an algebraic geom- 
etry based on a study of equations, and while they named theorems after Riemann, 
it was only after giving them entirely different proofs. They came to regard Rie- 
mann’s reliance on the ‘misleading’ Dirichlet’s principle as the problem, as Brill 
and Noether’s monumental historical survey of algebraic function theory, written in 
1894, makes clear. Definitions become indefinite, they wrote (1894, 265), and 


At such a level of generality the idea of a function, incomprehensible and evaporating, no 
longer delivers reliable conclusions. In order to delineate precisely the domain of valid- 
ity of the theorems under consideration, people have recently abandoned Riemann’s path 
completely. 


Riemann had also had some students of his own at Gottingen; we have already 
mentioned Roch, Hattendorff, Hankel, and Prym. There was also Enrico Betti, who 
was a very valuable Italian contact. But as we noted earlier Roch died in 1866 age 
26 and Hankel in 1872 age 34; Hattendorff died in 1882 age 48. Had Klein arrived 
in Gottingen in 1872 to study with a 45 year-old Riemann recovered from pleurisy, 
how different the growth of complex function theory would surely have been. 

The response from Berlin, by far the dominant university for mathematics at the 
time, was different. Weierstrass and Kronecker were the most influential analysts, and 
until their personal relations became clouded in acrimony in the 1880s they were often 
engaged together in trying to sort out fundamental points in both real and complex 
analysis. This opened the way for Schwarz, initially with Weierstrass’s support, to 
examine more and more of what Riemann had done with a view to making it not 
only more rigorous but also more orthodoxically Weierstrassian. This resulted in 
the Schwarz—Christoffel approach to the Riemann mapping theorem and Schwarz’s 
alternating method for solving a modified form of Dirichlet’s problem (Schwarz 
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1870a, b). Riemann’s insights were reworked at a price it will be instructive to 
examine when we have considered what these alternative, Weierstrassian principles 
were. 

In any case, Riemann’s topological way of thinking was not one Weierstrass 
wished to advance, and because he liked to present function theory entirely as he 
wished to see it and gave very few references to other literature the relatively large 
audiences he attracted would not necessarily know when they were entering Rie- 
mannian territory. Instead power series methods were given a central place; even the 
Cauchy integral formula was eliminated from his presentation of complex function 
theory. One unexpected result was that few of Weierstrass’s former students shared 
his preferences and he left less of a school behind him than he wished. 

Italian mathematicians proved the most sympathetic to what Riemann had been 
trying to say. Apart from Casorati, who never met Riemann, there was also Riemann’s 
friend Betti, who ran the Scuola Normale in Pisa for 30 years, The Scuola Normale 
was a small, high quality institution where most of the next generation of Italian 
mathematicians trained. Betti was faithful to the breadth of Riemann’s interests, and 
that influenced the work that came out of that school, for example in the papers 
they submitted in due course to Klein when he was establishing the Mathematische 
Annalen—the journal founded by Clebsch that he now edited—and trying to outflank 
Berlin’s Journal fiir die reine und angewandte Mathematik. 

On the other hand, Riemann’s influence was negligible in France for a generation, 
and this, together with the increasingly ambiguous attitude to it in Berlin, led to 
a justifiably profound sense of a rediscovery of Riemann’s ideas that attended the 
rehabilitation of his ideas in Géttingen in the 1880s. Klein began this process when he 
arrived in Gottingen in 1886, Hilbert took it up in due course, and so very markedly 
did Hermann Wey] in the next generation. 

Klein had become a professor in 1872 at the remarkably early age of 23, and 
swiftly attached himself to Clebsch, whom he later described as a divinely inspired 
teacher. He was highly talented, very ambitious, and saw an opportunity for himself 
in mastering what Riemann had done, not least because he shared some of Rie- 
mann’s traits, a rich intuition and a liking for geometry among them. But in 1881 
the young French mathematician Henri Poincaré began to publish a stream of dis- 
coveries involving non-Euclidean geometry and automorphic functions, topics close 
to Klein’s interests but treated from a different perspective (oddly enough, a more 
Riemannian one) and Klein’s efforts to match this work, as he was later to say, cost 
him his health. 

Part of Klein’s programme to recover was to lay ever greater claim to the Rie- 
mannian mantle. His energies diversified out of research, which he came to delegate 
to his best students, and into building up mathematics at Gottingen in direct compe- 
tition with Berlin. He sought money from the state of Prussia, brought in industrial 
money for work on applied mathematics, and set about creating a tradition of math- 
ematics at Gottingen that embraced Gauss, Riemann, and his own day.” He helped 
edit the many unpublished papers of Gauss; he wrote a richly interesting history of 


5 See Tobies (1981). 
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mathematics in the 19th century; and finally he broke with the successors of Clebsch. 
Clebsch is the only mathematician Klein criticised in his popular lectures in Evanston 
in 1893, and Brill and Noether repaid him by slighting his contributions in their own 
historical survey (1894). 


Textbooks 


If we count the book (Briot and Bouquet 1859) as a book on complex function and 
elliptic function theory, then the first book on complex function theory proper is the 
one by Durége, a former colleague of Riemann’s. As one would expect, it is along 
the lines of Riemann’s lectures. It ran to four editions, was translated into English 
for use in American universities, and was used by those in the Weierstrassian school 
when they found themselves outside Berlin. For a time Teubner editions carried an 
endorsement of it by Fuchs, for example. In due course there was a polarisation of 
German textbooks into either Riemannian or Weierstrassian styles, and then there 
was an attempt to combine the best of both. But curiously, Weierstrass himself never 
published his version (or, better, versions) and the result was to be a failure to spread 
his ideas abroad, where the Cauchy—Riemann style prospered. 

In 1865 the first edition of Carl Neumann’s Vorlesungen tiber Riemann’s Theorie 
der Abel’schen Integrale came out. Neumann confined his analysis to the hyperel- 
liptic case, and with that restriction aimed to make Riemann readable. In the second 
edition Neumann did the general case of Abelian functions, but now he retreated from 
his earlier endorsement of Dirichlet’s principle. The first edition was disparaged by 
Roch, who, although he found it praiseworthy, found it too long to be lectures, often 
verbose, and to differ too much from Riemann’s own presentation of these ideas. 
The second edition, when it was re-read in G6ttingen later, was said by Klein to 
have made every thing look so easy it was insulting, which shows how far things had 
swung Riemann’s way.° 

Mention should also be made of the book by Schl6milch (who, by the way, was the 
person who encouraged the young Roch to go to Gottingen and study with Riemann). 
His (1866) contains enough material on the subject to count as only the fourth book 
on complex function theory to be published, and it ran to several editions. Pages 35 
to 111 cover functions of a complex variable, and further chapters look at elliptic 
integrals and elliptic functions. Like Durége he used the theory of Riemann surfaces 
to deduce the properties of elliptic functions from those of elliptic integrals. 

Casorati’s remarkable book of 1868 is one of the best textbooks. It opens with 
a reliable and insightful historical account of almost 200 pages, and then devotes 
over 250 pages to describing the current state of the art. The ideas and techniques 
of Cauchy, Riemann, and Weierstrass, are all presented, and the book did a lot to 
establish Riemannian complex function theory in Italy. Oddly enough, the first part 
about Riemann surfaces and Abelian functions is more difficult than the second, 
which goes over ground that Riemann himself had covered in his lectures. 


®See Klein Entwickelung (1926-1927, 1, 273). 
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18.5 Complex Algebraic Geometry 


It has become entirely natural for mathematicians today to regard the equation 
Ff (x, y) = 0, where f is a polynomial in x and y, as defining a curve in the Cartesian 
plane with coordinates x and y, and to think of the curve as the set of points (x, y) 
in the plane for which f(x, y) = 0. If not from the time of Descartes then certainly 
since Euler it has seemed natural to treat these curves algebraically by analysing their 
equations. Thus the fundamental theorem of algebra says that a polynomial equation 
of degree n has as n solutions, and geometrically that says that a curve of degree 
n will meet a straight line in n points.’ A more complicated argument establishes 
Bezout’s theorem, that two curves of degrees k and m respectively will meet in km 
points. 

There is a price to pay. A line should meet a conic section in two points, according 
to the algebra, but it may not meet it at all (in such cases the algebra gives complex 
points of intersection). The line may be a tangent to the curve, in which case algebra 
suggests counting the point twice. And the line x = a meets the parabola y = x? in 
the point (a, a*)—but where is the second point of intersection? All of these ques- 
tions can be answered by moving to the setting of complex projective geometry. In 
19th century language, projective space contains points at infinity, and this allows 
mathematicians to give a rigorous account of Bezout’s theorem once complex coor- 
dinates are introduced. Which raises the question: who first endowed a curve with 
all its complex points? 

It became gradually accepted that two real curves of degrees k and m respectively 
will meet in km points, provided some of the ‘points’ are complex. But this was 
variously regarded as an artefact of the algebra, or as an indication that the curves 
possessed a symmetry of some kind (a map from the curve to itself that fixed the real 
points and switched each complex one with its conjugate, for example). The issue is 
when a plane curve given by a polynomial equation f(x, y) = 0 was taken to be a 
subset of C x C. 

One partial answer is provided in a paper of Cauchy’s on elliptic function the- 
ory that he published in 1846, in which he spoke of the “complex variable y [that] 
satisfies the finite equation y* = (1 - ca) (1 - hex) [and] varies with x by insen- 
sible degrees”. What this involves was soon addressed by Puiseux in his long paper 
(1850), in which, inspired by Cauchy’s work, he showed how to regard a function 
f(u, z) = 0 of degree m in u as an m-sheeted covering of the z-plane branched 
over a number of points.® These examples suggest that complex analysts were mov- 
ing towards a geometric appreciation of some functions, but they fall far short of 
Riemann’s vision. 

Riemann was clear on every occasion that he regarded an algebraic curve with 
equation f(s, z) = 0 of degree n in s and m in z, as an equation in complex vari- 
ables that should be thought of providing an n-fold covering of the complex z-plane 


7The degree of a polynomial f(x, y) is defined to be the highest degree of its monomials. 

8 Although we know from Laugwitz (1999, 92) that Riemann had read Cauchy’s report on Puiseux’s 
memoir by December 1851 it seems unlikely that Riemann had anything to learn from Puiseux by 
the time his was writing his doctoral thesis. 
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(or, better, z-sphere). He then explained how such a covering surface could be 
regarded (in many different ways) as an algebraic curve in C x C. From his standpoint 
a curve simply had complex points on it. 

Such an idea is implicit in the work of Abel and Jacobi on elliptic functions, 
but they never presented it geometrically. Indeed, Jacobi found the idea of a two- 
valued integrand in an elliptic integral sufficiently complicated to motivate his move 
to base the theory of elliptic functions on the theory of theta functions and so avoid 
the integrals altogether. Riemann’s vision of complex algebraic curves was another 
barrier to the reception of his ideas, as Cayley’s comments in a paper of 1878 indicate. 
He distinguished carefully between Riemann’s approach and the simpler idea that a 
complex curve is a set of points in C x C, and then of the second idea “I was under the 
impression that the theory was a known one; but I have not found it anywhere set out 
in detail”. It would seem that only somewhere between Clebsch’s turn to algebraic 
geometry in 1864 and Cayley’s remarks 14 years later did curves acquire their full 
complement of complex points. 


18.6 Exercises 


1. What is the value of the Riemann mapping theorem? 

2. Illustrate, with examples, the value in being able to define complex functions on 
a variety of surfaces other than the plane. How does Riemann’s framework help 
explain the double periodicity of elliptic functions? 

3. What aspects of Riemann’s work were adequately established by the time he 
died? What problems confronted mathematicians wishing to take up Riemann’s 
work after his death in 1866? 


Chapter 19 
Weierstrass 


Pecans tle “ 


Weierstrass (1815-1897) 


19.1 Introduction 


A powerful, algebraic, alternative to Riemann’s geometric complex function theory 
was developed by Weierstrass. In this chapter we look at his early work, his success- 
ful arrival in Berlin, and his first lecture courses. Weierstrass was to be a powerful 
influence on the development of mathematics in Germany, and very often his influ- 
ence was indirect, because he frequently confined his discoveries to his lectures in 
Berlin. 
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19.2 Weierstrass’s Early Years 


If the generic myth of the great mathematician is someone who comes from nowhere 
and dazzles immediately (as, for example, Galois did) then Karl Weierstrass is the 
antithesis, in that he published his first paper in 1854 when he was almost 40. Even 
so, it must be admitted then he then rose very rapidly to prominence, because in this 
paper he solved the Jacobi inversion problem for hyperelliptic integrals, a fact that 
startled his contemporaries, and he went on to dominate the mathematical world for 
the next 40 years. 

Much of the mystery surrounding his late appearance dissipates, only to be 
replaced by another, when we learn that Weierstrass had been conducting impor- 
tant research for several years by 1854 but had not bothered to publish it. A deeper 
mystery is that he had a lifelong aversion to seeing his work appear in print, and 
published very little, confining himself lectures. 

Weierstrass was born on October 31, 1815 in Ostenfelde near Miinster, and spent 
four years, from 1834 to 1838 at the university of Bonn supposedly studying courses 
in jurisprudence and administrative matters that would lead to a career in the public 
services but without getting a degree or even trying to pass any examinations. Karl’s 
brother Peter told Gésta Mittag—Leffler that Karl was active in the Jugendabenteuern, 
joined the Korps Saxonia and spent his time more in cellars and student fights than 
in studying.' On the other hand, other sources suggest that Weierstrass had begun 
to read Crelle’s Journal while still at school, and he himself claimed in 1840, in a 
report on his education, that he had studied mathematics intensively while at Bonn. 
Perhaps both accounts are true. At all events, Weierstrass does seem to have studied 
Laplace’s Mécanique céleste and Jacobi’s Fundamenta Nova on his own, and at some 
stage he must have realised that his future lay with mathematics. 

Weierstrass entered the Theological and Philosophical Academy in Miinster in 
1839 to obtain a teacher’s diploma. The decisive event for his future career was 
Christoph Gudermann’s lecture course on the theory of elliptic functions. Gudermann 
was a friend of Jacobi and he based his course on his friend’s Fundamenta Nova, 
developing the elliptic functions as power series. There are three fundamental elliptic 
functions, for which Gudermann introduced the convenient notation sn(x), cn(x), 
and dn(x). As with the sine and cosine functions in trigonometry, these functions are 
connected by differential equations that can be made to yield the corresponding power 
series by the method of undetermined coefficients, but the argument is considerably 
more complicated. 

Weierstrass took up the subject, which was his introduction to the formal theory 
of power series, published a thesis that Gudermann thought very highly of, and after 
passing the examination, moved to the Gymnasium Paulinum in Miinster. 

Here he wrote his second paper (Weierstrass 1841a), although he did not attempt 
to publish it.2 This is odd, because, as Mittag—Leffler (1923, 27) was to remark, in 
this paper. 


'See Mittag—Leffler (1923, 14-15). 
Tt was published for the first time in his Werke towards the end of his life. 
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one can find, as far as is known, the first rigorous proof of Cauchy’s theorem on the integration 
between two given complex limits without any application of double integrals or surface 
integrals; this is given in such a way that one obtains at the very same time the Laurent 
theorem, which became known only two years later through a note by Cauchy. 


It is all the more remarkable, to quote Mittag—Leffler again, when one notes that 
“Weierstrass first became acquainted with Cauchy’s works in 1842”. 

As Bottazzini was the first to observe (see Bottazzini 2002), the emphasis on 
power series methods is characteristic of Weierstrass’s approach to complex analysis 
because it allowed him to develop uniform methods that apply to functions of both 
one and several variables, and complex functions of several variables were intended 
by him to open up the study of hyper-elliptic and abelian functions. This commitment 
is apparent in his (1841b), the second paper Weierstrass wrote in 1841. In it he stated 
and proved three theorems about power series expansions (by which he meant both 
‘ordinary’ power series and Laurent series expansions). Theorems (A) and (B) show 
how to evaluate the coefficients of a Laurent series in one and resp. several complex 
variables without the use of integrals. Weierstrass did not know of Cauchy’s Turin 
1831 Mémoire on the ‘calculus of limits’, and derived the relevant inequalities for 
Taylor series independently. 

These theorems run as follows: 


Theorem (A): Let F(x) = mas: A,x” bea power series of the complex variable x 
with given coefficients and r ‘any determinate, positive magnitude which lies within 
the domain of convergence of the series’ (Weierstrass 1841b, 67). Let sup | F(x)| = g 
for |x| =r. Then, 


|Au| < gr 


for every integer value of ju. 


Weierstrass’s proof is based on his assumption that the series is uniformly convergent 
for |x| =r. 


Theorem (B) extended this result to functions F(x), ..., Xp)) of several complex vari- 
Biles Met E (aii x Rp) yy dant eaue ey Ay eh .x," be a convergent series, 
where x1, X2,..., Xp are complex variables, Ay,,..,... sy, are given constants and 
vy, € Z, and let r;(1 <i < p) bea system of positive quantities such that the point 
(x1, X2,...Xp) = (11,12, ...Tp) belongs to the domain of convergence. In addition, 
let sup |F (x1, etc | = g for |x;| = 7; for any i. Then 
—H1,—H2 PB 
Aisigsicormyp Se Ory Ry tp” 


for every system of integer values of j1;. 


The proof is similar to the first one, and is based on the assumption that the series 
for values x such that |x;| = r; for any i converges uniformly. 
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Weierstrass’s Theorem (C) is the double series theorem nowadays named after 
him: Given an infinite sequence of functions Fy,(%1,...,X,) defined by ‘ordinary’ 
power series 


yy V2 Yp 
Fils .224 0p) = > Al sy gs estiy dy Me” xcckp 
V 


such that each of the functions (and their sum) is ‘unconditionally and uniformly’ 
convergent in a determinate neighbourhood G of the origin (0, 0, ..., 0), then 


+00 

V1 v2 ud 
> Fy,(X1,-..5%p) = > Ayyiry +++ sv) Xy XQ eee 
p=0 Vy 


lL 
where 9) Ary, 125+ ++ +v = Ansys +++ su) < +00 
Le 


In other words, under the given hypotheses about convergence, the sum of the 
series of functions F,, can be expressed by a power series in x1,..., x, whose 
coefficients are given by the sum of the corresponding coefficients of the power 
series which represent the functions F;,. 

Weierstrass proved this theorem by using the Cauchy inequalities for Taylor coef- 
ficients that he had already obtained in Theorem (B). His arguments show that he 
now had a clearer understanding of the phenomenon of the uniform convergence of 
series, as far as the interchanging of two infinite series was concerned. 

The importance of these theorems, and particularly the last, is that “for Weierstrass 
the double series theorem was the key to convergence theory”, to quote (Remmert 
1989, 251). From Theorem (C) Weierstrass obtained the theorem on differentiating 
uniformly convergent series. He developed each F;, into its Taylor series around any 
point a1, ..., a, of G and observed that, as a consequence of Theorem (C), for fixed 
a\,..., 4, they converge in some fixed disk centred at a), ..., a, and lying in G. 


Therefore the series 
+00 
b> F(a, .-+5 4p) 
u=0 


represents an analytic function there, and has derivatives of any order which are 
given by 


Oa = a 
Dry es Fults Xp) = Da Fults +p), DAS Vy cegsp: 
p=0 L=0 


As with his previous papers, even this paper remained unpublished until 1894, 
when the volumes of Weierstrass’s Werke began to appear. However, Weierstrass 


3However, Weierstrass made the unnecessary hypothesis that the series converges unconditionally, 
i.e. absolutely. 
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sometimes presented Theorem (A), and even also Theorems (B) and (C), in his 
Berlin lectures on the introduction to analytic function theory.* 

A striking feature of these early papers, and it persisted throughout Weierstrass’s 
career, is his avoidance of methods that rely on the Cauchy integral theorem. 
Bottazzini has suggested that Theorem (B) suggests that Weierstrass’s drive for a 
theory of several variables may provide the reason. To be sure, Weierstrass could 
have proved Theorem (A)—which is about functions of a single variable—by intro- 
ducing an integral taken around the boundary of a disk of a suitable radius. But 
for functions of several variables case there was no way then to handle the integral 
of a function of several complex variables taken over the boundary of a polydisk. 
However, the method of power series applied with not much extra work to several 
complex variables as well as one. For this reason, Bottazzini argues, Weierstrass 
seems to have turned against integrals (‘transcendental’ methods, as he later called 
them) in favour of power series as early as 1841. 

In the spring of 1842 wrote a third paper that he also withheld from publication. 
In it he established the existence of solutions of a system of n algebraic ordinary 
differential equations, unaware of what Cauchy had done in a series of papers starting 
with (Cauchy 1842).° These three papers gave Weierstrass the foundations of all of 
his later theory of analytic functions. 

In autumn 1842 Weierstrass obtained a position as teacher at the Katholisches 
Progymnasium in Deutsch Krone in West Prussia. He had to teach a wide range of 
subjects, but in spite of this he was able to continue his research. In 1848 he got a 
position at the Gymnasium in Braunsberg. The next year he published a draft of a 
theory of Abelian functions in Programmheft of the Gymnasium—academic schools 
in Prussia were proud of the qualities of their teachers and encouraged them to do 
some sort of research. Weierstrass then developed the basic ideas in this paper in the 
two papers that appeared in 1854 and 1856 in Crelle’s Journal and made his name. 

In the first Weierstrass solved the Jacobi inversion problem in the hyper-elliptic 
case, a problem that had reduced Jacobi, on his own admission “almost to despair” 
until he had found the key in a paper of Abel’s. Even so, understanding of the 
problem remained limited until Weierstrass solved it by his completely different 
route. His 1854 paper “caused a degree of surprise all over the mathematical world, 
which remains almost unique in the history of our science” (Killing 1897, 718). 
Borchardt, the editor of Crelle’s journal, went to Braunsberg to meet the author 
of the paper. Liouville wrote a complimentary letter to Weierstrass, and published 
a French translation of his paper in the Journal de mathématiques that he edited. 
Soon thereafter Weierstrass was awarded by a doctor degree honoris causa from 
KGnigsberg University, and a delegation led by Friedrich Richelot went to Braunsberg 


*Bottazzini has considered Killing (1868, 66-68), Hettner (1874, 278-294) and Hurwitz (1988a 
99-100, 123 and 155), see Bottazzini and Gray, (Chap. 7). 


7 Fi : dx : 
5 Weierstrass considered n equations of the form ae = Gj(x1,..., Xn), for n unknown functions 


X1,...,X,, Where the functions G; are polynomial functions in x),..., Xp. 
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to hail Weierstrass as “our teacher”.° The paper (1856a) is an expanded and detailed 
version of the first one, and was also published in Crelle’s Journal. 

The publication of his 1854 paper opened the way to Weierstrass’s academic 
career. In January 1855 Crelle wrote to the Prussian Minister of Education to ask them 
to see that Weierstrass could be found an academic position in order to continue his 
research. The Ministry naturally consulted Dirichlet, who confirmed Crelle’s views 
and hopes while noting, as elder generations of mathematicians are prone to do, 
that Weierstrass “has given only partial proofs of his researches [in his 1854 paper], 
where the intermediate explanations are lacking” (in Dugac (1973, 52)). But in 1855 
Gauss died, and this provoked a round of academic hirings and promotions. Initially, 
Weierstrass was an unsuccessful candidate for a chair in Breslau, but this failure seems 
to have shaken up the Berlin mathematicians, for very soon, thanks to the joint efforts 
of Alexander von Humboldt and Richelot, Weierstrass was appointed professor at 
the Berlin Gewerbeinstitut (the Industry Institute, later Gewerbeakademie, today 
the Technische Universitat). In September 1856 he became a professor at Berlin 
University. There he joined Kummer and the privately wealthy Leopold Kronecker, 
a former student of Dirichlet’s and Kummer’s and (after 1860) a member of the Berlin 
Academy of Sciences. So by the mid-1850s the celebrated ‘triumvirate’ of Kummer, 
Kronecker and Weierstrass, which was to dominate the German mathematical milieu 
for some 25 years, had settled in the Prussian capital. For the former teacher at 
Braunsberg, at the age of 40 a new life began. 


19.3 Weierstrass in Berlin 


Weierstrass threw himself into teaching. Among his early courses was one on “The 
applications of elliptic functions to geometry and mechanics’ (Winter Semester, WS, 
1857/58). In winter 1859/60 he lectured for the first time on “Introduction to analysis’, 
and in summer 1860 he gave a course on integral calculus. Then he came back to his 
favourite subject, elliptic functions, and devoted a course to a selection of problems 
to be solved by means of these functions. In WS 1861-62 he announced a course 
on ‘The general theory of analytic functions’, but a serious breakdown caused by 
overwork forced him to stop lecturing for one academic year. Once he recovered 
he resumed teaching elliptic functions and their applications, and in summer 1863 
delivered his first course on the introduction to the theory of Abelian functions. 
Eventually, in WS 1863/64 he was able to teach the promised course on analytic 
function theory. 


A Course on Differential and Integral Calculus 


We get a glimpse of Weierstrass’s early lectures on analysis by looking at a set of 
(unpublished) lecture notes taken by Schwarz from a course on ‘Differential and 
Integral Calculus’ given by Weierstrass in summer 1861 at the Gewerbeinstitute in 


©See Killing (1897, 718). 
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Berlin. These are at times rather sloppy, and not always any better than Cauchy had 
already done. That said, the last topic Weierstrass presented was the differentiation 
of infinite series. He began by raising the question about the conditions one has to 
impose so that a series of continuous functions, each of which is convergent for all x 
within two given limits, represents a continuous function. He answered the question 


precisely, saying that the series is convergent “in equal degree”.’ 


when the sum of r terms of the series following the nth (r being an arbitrary positive whole 
number) for increasing n can be made smaller than any arbitrarily given magnitude, and this 
such that one may take the same n for all values of x and the property still holds true when 
instead of n one takes n + m, m being any positive whole number. 


Nowadays this form of convergence is called uniform convergence. Weierstrass’s 
clumsy expression is the result of using words rather than symbols, but it also reflects 
the difficulty of the concept that Weierstrass is trying to convey. 
By means of an €—n argument Weierstrass then proved rigorously that the uniform 
convergence of the series 
[o,@) 
>) n(x) 


n=1 


of continuous functions in a given interval (a, b) ensures the continuity of the sum 
of the series.® 

Weierstrass then investigated the term by term differentiability of a series of func- 
tions. He considered two series of functions >* n(x) and > y’, (x), that are contin- 
uous on a given interval and such that y/, (x) is the derivative of the corresponding 
term Y,(x) for each n. Under the hypothesis that the series converge uniformly in 
the interval, he proved that the function 


g(x) = >°¥,@) 
is indeed the derivative of the function 


v(x) = >° gn(x) 


there; that is, term by term differentiation is valid. Now Weierstrass could use the 
differential calculus to obtain the series expansion of elementary transcendent func- 
tions such as the exponential and the trigonometric functions. Finally, he determined 
the series expansion of a function given the series expansion of its derivative. 

This course exerted a great influence on Schwarz and, in turn on Eduard Heine 
who was a professor at Halle when Schwarz began teaching there in 1867. From 


7In German “Convergenz in gleichem Grade’, a term he took from Gudermann, as we discuss below, 
see § 22.4. 

8Schwarz mistakenly wrote that the theorem states a necessary and sufficient condition, but the 
proof clearly shows that it is a sufficient condition. 
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Heine himself we learn that Weierstrass’s ideas were spread in conversations with 
Weierstrass, Schwarz, and Cantor as well.” 


19.4 Weierstrass’s Programme of Lectures 


As his lecture courses developed over the years, and especially after July 1864 when 
he was appointed Ordinarius at the University, and gave up teaching at the Gewerbe- 
institut, Weierstrass became more and more deeply concerned with rigour in analysis. 
In particular, he rightly realised that the theory of analytic functions required rigor- 
ous foundations, and in his eyes these functions were to provide the foundations of 
the whole building of the theory of elliptic and Abelian functions. Poincaré caught 
this well in his obituary of Weierstrass (Poincaré 1899) when he summarised Weier- 
strass’s work in this way: 


1. To deepen the general theory of functions of one, two and several variables — this was 
the basis on which the whole pyramid should be raised. 


2. To improve the theory of transcendental and elliptic functions and to put them into a 
form which could be easily generalised to Abelian functions, the latter being a ‘natural 
extension’ of the former. 


3. Eventually, to tackle Abelian functions themselves. 


Unlike other mathematicians, who preferred papers or books, Weierstrass chose 
to realise this programme through his university lectures. Over the years one of his 
major concerns became the establishing of analysis, and the foundations of analytic 
function theory in particular, with absolute rigour. Until the end of his teaching career 
he presented a course in analysis in a cycle of lectures over four consecutive terms, 
that ran as follows: 


1. Introduction to analytic function theory 

2. Elliptic functions 

3. Abelian functions 

4. Applications of elliptic functions or, sometimes, the calculus of variations. 


Most of his original discoveries were presented first, and on occasion only, in his 
lectures, and the steadily evolving lecture cycle was often the only way to find out 
what his opinions and insights were. For example, when Weierstrass proved that a 
convergent power series is uniformly convergent within the domain of convergence, 
he thereby showed that a function defined by a power series is continuous on any 
domain strictly inside its circle of convergence, because the convergence is neces- 
sarily uniform there. This result had been previously stated by Abel but not given an 
adequate proof. But few if any could know of Weierstrass’s discovery other than by 
going to the University of Berlin to hear him developed his ideas during his many 
lecture courses. 


°See Heine (1872, 172). 
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This practise drew a large number of students to Berlin, but it was ultimately an 
unsuccessful way to promote his ideas. 


Casorati’s Notes 


The Italian mathematician Felice Casorati was so irked by this secrecy that he tried 
to visit Weierstrass in Berlin and corresponded intensely with Weierstrass’s former 
student Schwarz to find out about Weierstrass’s most recent discoveries; he hoped 
to present these in the planned second volume of his treatise, but in the end it never 
appeared. However, the historian is very fortunate in having Casorati’s notes of his 
trip to Germany.!° 

Casorati had decided to travel to Berlin, in order to understand what was going 
on in the town that was becoming the centre of mathematical world. He made notes 
of his conversations with the Berlin mathematicians in a scientific diary, and they 
provide an intriguing glimpse of some of the problems that were at the forefront of 
mathematical research at that time. He met Kronecker and Weierstrass separately, and 
occasionally together, and at times, students and other mathematicians attended the 
meetings. Kronecker, for example, remarked that the concept of continuity “was still 
a confused idea”, and the Dirichlet principle and the concept of analytic continuation 
also came up—topics that were to continue to interest Weierstrass for years to come. 

Kronecker even surprised Casorati by claiming to have examples of “functions that 
do not admit differential coefficients, that cannot represent lines etc.” Casorati’s few 
notes unfortunately do not allow us to decide whether Kronecker actually discovered 
such ‘pathological’ functions, or simply reported what Weierstrass had done on 
his own. Schering, who was at this particular meeting, mentioned Riemann’s still 
unpublished but quite well-known example of an integrable function discontinuous 
at every rational point. 

Casorati noted that already the Berlin group were becoming more and more critical 
of Riemann’s way of working: “Riemann’s things are creating difficulties in Berlin’, 
he recorded. For example, Kronecker criticised Riemann and his followers were 
for assuming that the singular points of a complex function are always such that 
any two non-singular points can always be joined by a path that avoids the singular 
points. “But this is not possible”, Weierstrass told Casorati, and “It was precisely 
while searching for the proof of the general possibility that he realised it was in 
general impossible”. Kronecker provided Casorati with this example to illustrate the 
impossibility: the function 


CO 
6o(q)=1+2>04", lal <! (19.1) 


n>1 


The function 69(q) cannot be analytically continued outside of the disk |g| < 1. 
“There is another expression for it’—Weierstrass told Casorati—‘“which also has 
meaning outside of the circle, that is, inside and outside but not on the circumference”. 


'OThese have been studied by Neuenschwander (1978a, b) and more recently in Bottazzini and 
Gray (2013, § 6.4.1). 
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The latter is “entirely made of points where the function is not defined, it can take 
any value there’. In this rather cryptic way Weierstrass announced a result he was 
to publish only in 1880.!! This was intimately connected with his example of a 
continuous nowhere differentiable function which he was to publish some years 
thereafter. !* 

In order to know the behaviour of the function defined in equation (19.1) outside 
the circumference |g| = 1, Kronecker observed that “it is necessary to resort to other 
means, and not to those of making g follow a path” joining a point within the disk 
with a point outside it. According to Casorati’s report, Weierstrass had thought for a 
long time that points at which a function “ceases to be definite [...] could not form a 
continuum, and consequently that there is at least one point P where one can always 
pass from one closed portion of the plane to any other point of it” until he realised 
that “this was not the case”. As an example of a point at which a function ceases to 
be definite, Weierstrass cited x = 0 for the function e!/*, saying that “it can have 
any possible value” there. 


Weierstrass and Schwarz 


Casorati’s notes amplify the picture of the issues that concerned Weierstrass as he 
set out to produce a rigorous theory of complex analysis adequate to the demands 
he wished to impose upon it. As the years went by, this quest took on a markedly 
algebraic cast, as a much-quoted passage from a letter he sent to Schwarz in 1875 
shows: 
The more I think about the principles of function theory — and I do it incessantly — the 
more I am convinced that this must be built on the foundations of algebraic truths, and 
that it is consequently not correct when the ‘transcendental’, to express myself briefly, is 
taken as the basis of simple and fundamental algebraic propositions. This view seems so 
attractive at first sight, in that through it Riemann was able to discover so many of the most 
important properties of algebraic functions. (It is self-evident that, as long as he is working, 
the researcher must be allowed to follow every path he wishes; it is only a matter of systematic 
foundations) (in: Weierstrass Werke II, 235). 


The ‘transcendental’ methods Weierstrass was rejecting were those involving the 
integral, and he rejected them even when dealing with functions of a single variable, 
which, as we shall see, gave his presentations a somewhat contrived feel at times. 
Also to be rejected, in his own work and, he implied, in that of his students, was 
any lasting use of geometric intuition, although it could be allowed to assist in the 
process of discovery. 

Weierstrass, of course, did not see these strictures as limitations. It was not a whim, 
or an idle prejudice, that caused him to reject methods that had served Riemann 
so well. Rather, as he explained later in the same letter to Schwarz he had been 
“especially strengthened [in this belief] by his continuous study of the theory of 


‘Tn the same paper in which he first published his M-test. 

This series had already been used by Jacobi (Fundamenta nova, § 65) in his proof of Fermat’s 
theorem on the number of representations of a number as a sum of four squares. The connections 
with ‘pathological’ functions had been made evident by Riemann in examples he presented in his 
Habilitationsschrift by considering the limit cases of certain theta-series. 
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analytic functions of several variables”, which was required to create a theory of 
Abelian functions. To this end, Weierstrass’s lectures on analytic function theory 
would conclude with the elements of a theory of functions of several variables. 


19.5 Introductions to the Theory of Analytic Functions 


We can trace the developments in Weierstrass’s theory only intermittently in his 
publications, but much more steadily in his lectures courses from WS 1861/62 to 
WS 1884/85, of which many notes taken by his various students survive. (They were 
kept, one supposes, precisely because Weierstrass was developing a systematic theory 
that could not be found anywhere else and made almost no reference to any existing 
literature.) And indeed, Weierstrass seems to have been a good lecturer, at least for the 
better students. Fifty years later Leo Koenigsberger recalled attending Weierstrass’s 
lectures on elliptic functions with Lazarus Fuchs, and wrote (Koenigsberger 1917) 
that they could never forget the “powerful impression and incomparable excitement” 
exerted on them by the “deep and over-arching content” of that lecture combined with 
the “natural, unvarnished” way of expounding it that was “free from any rhetoric”. 

In his lectures, Weierstrass regarded the fundamental property of analytic func- 
tions as the fact that they can be expanded as Taylor series. He therefore approached 
them by starting with functions defined in terms of a finite number of the simplest 
arithmetical operations (addition and multiplication) and proceeding to functions 
defined by an infinite number of such operations. 

Weierstrass agreed that analytic functions in his own sense are precisely the func- 
tions u + iv that Cauchy and Riemann had characterised as the solutions of the 
Cauchy—Riemann equations. But he said that he rejected that approach because, in 
his opinion, the general definition of a class of functions could be founded on an 
arbitrary property, whose generality cannot be established a priori. Moreover, the 
Cauchy—Riemann requires functions u(x, y) and v(x, y) that admit partial deriva- 
tives, and “in the present state of science” such functions form a class that cannot be 
exactly defined, and for this reason to the definition was rejected as being “poorly 
adapted for building the theory of analytic functions on it”.!4 

Weierstrass then defined uniform convergence for series of functions, and proved 
that the sum of a uniformly convergent series of rational functions is a continuous 
function. Only now did he introduce power series of one variable, when he was in 
a position to establish their important, basic properties. He established the a power 
series has a disk of convergence; more precisely, if zo is a point inside a domain on 
which a complex analytic function f is defined, then the function can be expanded as 
a Taylor series that converges on a disc Dp centre zg and reaches as far as the nearest 
singular point of the function to zo (that is to say, any larger disc would contain a 
singular point of f). Weierstrass also showed that the function converges uniformly 
within the disk. He produced the usual inequalities for the coefficients of a series 


13See Hurwitz’s notes (Weierstrass 1988, 49). 
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(without appealing to Cauchy integral theorem) and he remarked that “analogous 
propositions” hold for series of several variables. Next came the statement of the 
double series theorem for one and several variables, and then the proof that of the 
identity theorem (two series that agree on an (arbitrarily small) open neighbourhood 
of the origin define the same function). Finally, Weierstrass proved the usual theorems 
on term-by-term differentiation for series in one or several variables. 

In keeping with his rejection of ‘transcendental’ methods, as he called them, 
Weierstrass seldom lectured on the integral calculus, whereas the differential calculus 
is covered in almost every one of Weierstrass’s lecture notes on analytic function 
theory. The integral comes up in Killing’s notes of the course in 1868, but only at 
the end of the course, after the treatment of series, and in the lecture course of 1886. 
We can read in the letter to Schwarz cited above that when Weierstrass discussed 
Cauchy’s theorem he remarked that “nowadays, following Riemann’s example the 
proof is usually obtained from the consideration of a double integral” (the Green’s 
theorem approach), but that “I do not consider this to be completely methodical”; he 
then gave a different proof. 


Analytic Continuation 


The concept of analytic continuation was familiar to Riemann, who used it exten- 
sively, but it became of foundational importance in the work of Weierstrass. Let f(z) 
be an analytic function defined on a disc Do centre zo, and suppose that z, is a point 
in the disc Do, then the function can be expanded as a Taylor series that converges 
ona disc D, centre z; and reaches as far as the nearest singular point of the function 
to z,. It is quite possible that the disc D; extends beyond the disc Do. When this 
happens the function f is said to have been continued analytically outside the disc 
Do. The process can be repeated until the function f has been provided with Taylor 
series expansions on as large an overlapping set of discs as possible. 

There are two points to note. If a chain of overlapping discs returns to a point 
already covered, it does not mean that the original value of the function there and the 
value obtained by analytic continuation agree. For example, the square root ‘function’ 
may be defined uniquely on a disc that does not include the origin—this corresponds 
to choosing a definite sign for the square root. It can be analytically continued around 
the origin until the domain of definition overlaps itself, but it returns to the negative 
of its original value. 

Note also that there may be points where the function cannot be analytically 
continued in this fashion. They form what is called the natural boundary of the 
function, and we shall see what Weierstrass made of them shortly. 


Chapter 20 
Weierstrass’s Foundational Results 


20.1 Introduction 


Weierstrass is remembered for many specific discoveries in complex function theory, 
and here we consider two of them: his insight into the distinction between poles and 
essential singularities and the idea of a natural boundary of a complex function and 
the connection to nowhere differentiable real functions. We then look more briefly at 
his systematic presentation of the theory of elliptic functions, and content ourselves 
with a mention of his representation theorem, and his final account of a theory of 
Abelian functions. 


20.2 Poles and Essential Singularities 


Weierstrass is remembered for a number of major results in complex function theory. 
One such striking insight is the distinction he pioneered between what are called 
poles or finite singularities, and essential singularities. On the face of it, a function 
such as f(z) = 1/z? is not defined at the origin, but its reciprocal is. Also, rather 
trivially, there is an integer n such that the function z” f(z) is analytic at the origin— 
in this case n = 2 and the function z” f(z) = 1. But it is also true that the function 
g(z) = 1/z* + 5/z — 9 + 22%, say, is not defined at the origin, but its reciprocal 
is. Again there is an integer n such that the function z”g(z) is analytic at the origin: 
n = 2 and the function z”g(z) = 1+ 5z — 9z? + 2z°. This means that functions of 
this sort, although undefined at the origin, may be said to tend to infinity at the origin 
in a straight-forward way. 

Compare the function h(z) = e*. As z tends to oo along the positive real axis, this 
function tends to +00. But as z tends to oo along the negative real axis, this function 
tends to 0. If we set z = it then h(z) = e!’ = cos(t) +i sin(t) which, interestingly, 
doesn’t seem to tend to any limit at all as ¢ tends to oo. But if we restrict f to the 
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values 2k7,k =0,1,... then h(2k7i) = cos(2k7) +i sin(2k7) = 1. If we let z go 
to infinity radially along the line y = mx,sox =t, y = mt we are led to consider 
e' (cos mt +i sinmt), and if we restrict t to the values 27k/m, k = 0, 1, ... this time 
we get infinity as a limit. More interestingly, if we let z go to infinity vertically, along 
the lines x = a, y = t weare led to consider e“ (cos tf +i sin), and if we now restrict 
t to the values 2k7, k = 0, 1, ... this time we get e“ as a limit. This means that any 
positive real number can be obtained as a limit of a sequence of points tending to 
infinity. Finally, if we let z go to infinity vertically, along the lines x = a, y=t 
we are led to consider e“ (cost + i sint), and if we restrict t to the values b + 2kz, 
k =0,1,... this time we get e“ (cos b + i sind) as a limit—so any complex number 
can be obtained as a limit of a sequence of points tending to infinity. 

All of this shows that the function /(z) = e* cannot be said to have meaningful 
limit as z tends to oo, and it follows that the function e!/* cannot be said to have 
meaningful limit as z tends to 0 and so there is no integer n such that the function 
z"e!/ is analytic at the origin. 

That is what Weierstrass proved. This shows that there is a fundamental distinction 
between functions f(z) for which there is an integer n such that the function z” g(z) is 
analytic at the origin and those for which there is not. The former kind are said to have 
finite poles, or a pole of order 1 where n is the least integer with the stated property. 
The latter kind was said by Weierstrass to have an essential singularity. Weierstrass 
was also able to show in 1864 that in a neighbourhood of an essential singularity the 
function gets arbitrarily close to any arbitrary value. In this he had been preceded 
by Casorati, and the result is therefore known, correctly, as the Casorati-Weierstrass 
theorem. ! 


20.3 Natural Boundaries 


The natural boundary of a complex function is so called because it is a natural thing 
to look for. Imagine that you have a complex function f defined by a power series 
on some open disc in the complex plane, say the unit disc centred on the origin, and 
you attempt to extend the domain of definition by choosing a point ¢ € |z| < | and 
obtaining the power series expansion of f in an open disc centre ¢. You are interested 
in the case where this new disc contains points not contained in the unit disc. What 
can happen? Well, obviously, you either succeed or you fail. If you always fail, then 
there is no point ¢ € |z| < 1 that is the centre of an open disc that sticks out beyond 
the unit disc, and so the function f cannot be analytically continued beyond the unit 
disc, and the boundary of the unit disc is accordingly a natural boundary for the 
function. 


' And even more accurately as the Casorati-Sokhotskii-Weierstrass theorem, after the Russian math- 
ematician Sokhotskii who came to the same ideas at about the same time, see Bottazzini and Gray 
(2013, § 7.7.3). 
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If you succeed, if only for some ¢ € |z| < 1, then you carry on trying to push out- 
side every disc you obtain until you have obtained the maximal domain of definition 
for the function. The points on the boundary of the domain are the natural boundary 
of the function. 

There might be no natural boundary. A polynomial function in z is certainly 
defined by a power series (indeed, a polynomial) that converges everywhere. Let’s 
say you admit poles, and want to investigate whether the function is defined at z = oo 
by the method of replacing z with 1/z. Then you find that rational functions (quotients 
of one polynomial by another) also have no natural boundary. In fact, by Liouville’s 
theorem, as Riemann seems to have known, the converse is also true: if a function 
defined on the whole extended complex plane has no natural boundary, then it is a 
rational function (this includes the case of a function defined by a polynomial). 

What about functions with natural boundaries? Now it gets much harder, and we 
shall not pursue the story very far. Mathematicians, such as Weierstrass and Hermite, 
quickly discovered that the unit circle can be a natural boundary, and when it is this 
can be because every point on the unit circle is a pole for the function, or because 
a dense set of points are, and an industry developed investigating what can happen. 
Then you can start to ask about what other point sets can be natural boundaries, and 
which cannot. For example, a single point can be a natural boundary, as is the case for 
the point at infinity and the exponential function, or as the origin is for the function 
e!/: if you prefer. 

There is one topic, however, in this domain that is worth pursuing just a little 
further here, and that is the connection with real functions. 


20.4 Disjoint Domains and Non-differentiable Functions 


In 1880 Weierstrass published a paper in which he showed that a single analytic 
expression could define an analytic function on two or more disjoint domains. This, 
he was pleased to note, refuted the opinion of Riemann’s that every complex function 
could be obtained by arithmetic operations on quantities. 

Weierstrass gave this example of a uniformly convergent series of rational func- 
tions to illustrate his new result, which he said he had found and presented in his 
lectures for years (and indeed had discussed with Casorati and Kronecker, as we saw 
in (19.4) above). It was given by the series 


CO 


Fo= ayes 


v=0 


which is convergent for |x| < 1 and |x| > 1. He proved (using results Jacobi had 
established for the theta function—see Eq. (19.1)) that in each domain this series 
represents an analytic function that cannot be analytically continued across their 
common boundary and into the other domain (Weierstrass 1880, 211). The series 
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therefore represents two functions, not two branches of one function. Therefore the 
method of series can define functions that Riemann’s methods cannot. 

This gave Weierstrass the opportunity to clarify an essential point connecting real 
and complex analysis. He had always insisted in his lectures on the importance of 
two theorems that “did not coincide with the standard view”: the continuity of a 
real function does not imply its differentiability; and a complex function defined in 
a bounded domain cannot always be continued outside it. As he put it, the points 
where the function cannot be defined may be “not simply be isolated points, but may 
also form curves and surfaces” (Weierstrass 1880, 221). 

To demonstrate this point, Weierstrass introduced a series © A,,x” that is assumed 
to be absolutely and uniformly convergent for |x| < 1. By restriction to |x| = 1 one 
obtains the series XA,e”', which represents a continuous function g(t) of a real 
variable t. If the power series can be analytically continued outside the circle |x| = 1 
then there is a power series with its centre in the disc and which converges on a disc 
Dy that includes part of the unit circle (where the new disc sticks out). The new 
power series is absolutely and uniformly convergent inside D1, and in particular on 
all points of the unit circle inside Dj, and the restriction g(t) is therefore infinitely 
differentiable as a function of f on that part of the unit circle. This means that parts 
of the unit circle where the restricted function g(t) is not differentiable must form 
part of the natural boundary of the function. So if the restricted function is never 
differentiable on the unit circle, the unit circle is a natural boundary for the original 
power series. 

Bottazzini has suggested that Weierstrass’s power series approach to complex 
analysis inevitably led Weierstrass to investigate the natural boundary of a complex 
function and so to the study of non-differentiable real functions on the boundary.” 
This duly led him to the discovery of continuous nowhere differentiable functions 
via an astute use of uniform convergence. And indeed, when referring to the French 
translation of his 1880 paper, Weierstrass observed to Schwarz on March 6, 1881 
with a fine sense of understatement: 


My latest paper created more of a sensation among the French than it really deserves; people 
seem finally to realize the significance of the concept of uniform convergence. 


Weierstrass’s Continuous Nowhere Differentiable Function 


One of the highlights of Weierstrass’s work on this topic was his announcement 
(Weierstrass 1872) of a continuous nowhere differentiable function. However, he 
published neither the explicit function nor, accordingly, a proof of the correctness 
of his claim, and it was only revealed when Paul du Bois-Reymond got permission 
from Weierstrass to publish it in Du Bois-Reymond (1875, 29-31). 


Riemann too, as Dedekind reported when editing Riemann’s papers, found his examples of patho- 
logical functions looking at limit case of @-functions. But, as Klein wrote in his history of mathe- 
matics in the 19th Century (1926, I, 286), Riemann’s attitude was quite different from Weierstrass’s: 
“Riemann excluded natural boundaries from his considerations. Weierstrass, on the contrary, was 
led precisely by his systematic manner of thought to look closer at the behaviour of an analytic 
function in the neighbourhood its natural boundaries”. 
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Fig. 20.1 The first few 
terms of a Weierstrass 
function | 
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This is the function 
f(x) = Xyb" cos(a"rx), O<b<1, ab>1+4+3n/2, aan odd integer. 


Continuity follows from the fact that, by Weierstrass’s M-test with M, = b”, the 
series is uniformly convergent for all x. The large values of a” that enter the sum 
very quickly make it clear that the function will be likely to oscillate a lot, which 
hints at non-differentiability, but the oscillations could be damped out by the shrink- 
ing powers of b. In fact, establishing non-differentiability everywhere is hard, and 
is nicely discussed in Bressoud (1994, 260-262), which is not that different from 
Weierstrass’s original account.’ The condition that the integer a be odd makes the 
fact that cos(N7) = +1 easier to handle. The condition that ab > 1 + 37/2 is 
what makes the tail of the series oscillate so much that the function is nowhere 
differentiable. 

We can get an impression of this by looking at graphs of the Weierstrass function 
fora = 13, b = 1/2, where Weierstrass’s conditions are obeyed, and fora = 3, b = 
1/2, where they are not. 

Even the first five terms of Fig. 20.1 suggest that differentiability is failing every- 
where, whereas Fig. 20.2, which shows the first 15 terms in the fake, suggests that 
differentiability is only failing at some points (albeit possibly a dense set). 

To get an impression of how tricky these convergence arguments were, note that 
Riemann had claimed in his lectures in 1861 that the function 


sin(n2x) 
n n2 
fails to be differentiable for infinitely many values of x, but it was only in his (1916) 
that Hardy showed that differentiability fails for a dense set of points in any interval, 
and only in 1970 was it shown that there are infinitely many points at which the 


3See Du Bois-Reymond (1875) and Weierstrass (1872), the version in Weierstrass’s Mathematische 
Werke, which differs from the account given by du Bois-Reymond only in the correction of an simple 
error. 
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Fig. 20.2 A ‘fake’ 
Weierstrass function 


function is differentiable.* Weierstrass suggested in his lecture that even Cauchy, 
Gauss, and Dirichlet, rigorous critics as they were, had never doubted that a con- 
tinuous function was other than differentiable except at isolated points where the 
derivative was either not defined or infinite. 

In his account of Weierstrass’s function, du Bois-Reymond explained carefully 
that a function could be continuous at a point without being continuous at any nearby 
point (he gave an example too complicated to be reproduced here). It was, he went on, 
one of the deepest recent results in mathematics that a function could be continuous 
at every point of an interval and differentiable nowhere, and this was Weierstrass’s 
discovery. Mathematicians in German circles had been concerned with this issue 
for some years, led by those in Riemann’s school, because Riemann had suggested 


that the function &,, a2 had this property. It was certain that the function failed 
to be differentiable at infinitely many points in any interval, but it seemed that no- 
one in that circle had put a proof down on paper. It was clear that Hankel’s method 
of condensation of singularities allowed one to construct continuous functions that 
failed to be differentiable at a dense set of points, but Weierstrass’s example deserved 
to be widely known because of its great simplicity and naturalness. 

Weierstrass also asked du Bois-Reymond to send a copy of the memoir to Gaston 
Darboux in France, because it was apparent that he was interested in the topic but 
unaware of what had been done.» 


4Weierstrass is our source for this information about Riemann, and interestingly he was of the 
opinion that differentiability fails everywhere for Riemann’s function but is “somewhat difficult to 
demonstrate”; see Weierstrass (1872, 72). 


5For more detail, see Schubring (2012, 570-575) and Gispert (1983, 49). 
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Gaston Darboux (1842-1917) 


Indeed, Darboux, in his memoir (1875) also gave an example of a function that 
is continuous everywhere and differentiable nowhere. This is the function 


ee) 


f= > smn 1)lx 


n=1 


It is worth pausing to note that every term in the summation is analytic and therefore 
infinitely differentiable, which tends to emphasise the fact that convergence and 
differentiability do not go easily together. Note also that Darboux’s function is a 
trigonometric series—can it be a Fourier series? 

The next two graphs show the sum of the first 4 and the first 6 terms of the series, 
and give a good impression of how crinkly the function is getting (Figs. 20.3 and 
20.4). 


Fig. 20.3. The first 4 terms 
of Darboux’s series 
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Fig. 20.4 The first 6 terms 


of Darboux’s series Me 


0.44 


0.2 


It would seem that Darboux had heard of Weierstrass’s announcement of 1872, 
because he mentioned it in a letter to his friend Houél that year, but had come to his 
own example of an everywhere continuous, nowhere differentiable function. 


20.5 Advanced Results 


It would take us too far afield to follow Weierstrass’s treatment of many topics here, 
but it is important to give an impression of what he accomplished that made so many 
mathematicians of his time regard him as the leading complex analyst, and one of 
the people most responsible for making complex analysis one of the most important 
branches of mathematics. A number of the more vital achievements will be listed. 


Elliptic Functions 


Weierstrass became the author of the definitive account for its day, although it is no 
longer regarded as the fundamental approach. His approach was to characterise the 
elliptic functions among the functions satisfying an algebraic addition theorem (a 
formula connecting the values of a function at vu, v and (u + v)). We can notice how 
algebraic, and ungeometrical this is, and also that the book that eventually appeared 
on it was a large and successful of formulae and results needed by anyone working 
with elliptic functions.® It had no historical remarks, and no technical motivations— 
in this it resembled the lecture courses that were the only way Weierstrass liked for 
the dissemination of his ideas. 

That said, in his lectures in 1874 rewrote the theory of elliptic functions in a lasting 
way. He denoted the fundamental periods 2w and 2w’ and introduced the function o 


defined by 
1 2 
ow) =u J] (i “) exp (# +54). (20.1) 
it w w w 


6The Formeln und Lehrséitze (or, Formulae and Results) of 1885. 


20.5 Advanced Results 215 


where w = 2uw+2vw", and yu, v integers. He then defined the function 9 (u), which 
“is one of the most important [functions] of the whole analysis”, as 


7 a =i 1 1 ga 
pu) =—T5 seo = a+ > (Gap): (20.2) 


where w = 2yiw + 2vw’, as above.’ g (u) is doubly periodic function with exactly 
one infinite point of order two in the period parallelogram. 

Weierstrass then deduced the crucial properties of his > function and of elliptic 
functions in general. For example, “Every doubly periodic function can be repre- 
sented in the form 

o(u—Uu,)...0(u—U,) 


puyse o(u —v1)...0(u — vy) 203) 


where the u’s and v’s can be chosen in such a way to build two complete systems of 
non-congruent magnitudes (i.e. the difference of any two of them is not a (multiple 
of a) period) that satisfy the condition 5° u, = >¢ vy,” (Theorem 13). Such a function 
is doubly periodic, and it is called an elliptic function of degree v, and “there are 
only elliptic functions of degree 2 and above”. 

Moreover, he showed that so (u + v) and go (u — v) are algebraically expressible 
in terms of £9 (uw), go (v) and their derivatives, and that so (u) = s is a solution of the 
differential equation 


ie 3 
aa = 45° — qos — 93, (20.4) 


where g> = 60 >) (+-)* and g3 = 140 > (54,)°. 

He then proved that if g2 and g3 are chosen arbitrarily a suitable pair of periods 
w and w’ be found the corresponding g (uw) function satisfies equation (20.4). To do 
so he introduced the quantities e), e2, e3 such that 


! (u)* = 4(9 (u) — e1)( (u) — €2)( W) — 3), 


and established the necessary properties. This is the definitive answer that links 
elliptic functions and elliptic integrals and shows that there is an elliptic function 
for every possible set of periods. He concluded his lectures by discussing the cases 
93 — 2793 # 0, and g3 — 2793 = 0. 

Weierstrass’s Representation Theorems 

The famous representation theorem, announced in letters at the end of 1874, showed 
that an arbitrary point set in the plane of complex numbers can be the zero set of 


an analytic function if and only if the set has no accumulation point. Weierstrass’s 
proof explicitly exhibited the required function, and the form of the function allowed 


7The order is reversed in the Formeln und Lehrséitze. 
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mathematicians to raise, and answer other questions of this type. For example, his 
student Mittag—Leffler showed in 1867 that one can prescribe the set of poles of a 
complex function in the same way. 


Abelian Functions 


All of Weierstrass’s work may be said to circle round the topic of abelian functions. 
This accounts for his algebraic preferences and for his interest in developing a theory 
of complex function in several variables. Despite the early successes which made 
his name and got him to Berlin, it is fair to say that he never got more than glimpses 
of how to proceed, and indeed made one or two significant mistakes in his attempts 
to go from one to many variables. We can only note here that this difficult topic was 
the promised land, not to be entered until after Weierstrass’s death. 


Chapter 21 
Revision—and Assessment 


At this point in the course we have seen a lot of mathematics emerge: the existence 
of harmonic functions satisfying certain boundary conditions, trigonometric func- 
tions, and complex analytic functions (which, in Riemann’s hands but not those of 
Weierstrass, have intimate links to harmonic functions). But it is also true that very 
little of this was solidly established at the time: Dirichlet’s principle was beginning 
to look too insubstantial to solve Dirichlet’s problem, and with it could fall much of 
Riemann’s complex analysis. The study of trigonometric series might open the way 
to new results about real functions, but intuition seemed to offer little guidance. 

One way to take stock is to grapple more closely with the sources, and five seem 
appropriate: 


1. Green on potential theory (his ‘An Essay on the application of mathematical 
analysis to the theories of electricity and magnetism’) 

2. Dirichlet on the theory of Fourier series (his “Sur la convergence des séries 

trigonométriques’ ) 

. Schwarz on conformal mapping (his ‘Ueber einige Abbildungsaufgaben’, 1869) 

4. Riemann on the Cauchy—Riemann equations (his (1851, Sects.1 to 4)) and 
Maxwell on conjugate functions (his “Theory of conjugate functions in two dimen- 
sions’ ) 

5. Stokes on uniform convergence (his ‘On The Critical Values Of The Sums Of 
Periodic Series’, 1847). 


WwW 


Green’s essay on potential theory is reprinted in his Mathematical Papers 
(pp. 356-374), which is available in several places on the web (consult the Digi- 
tal Mathematics Library and the Internet Archive). A translation of Dirichlet’s paper 
on the theory of Fourier series is given in Appendix A.2 at the end of this book, 
as is this particular paper by Schwarz on conformal mapping in Appendix A.5, and 
Riemann’s approach to the Cauchy—Riemann equations in Appendix A.3. Maxwell’s 
Treatise on Electricity and Magnetism 3rd ed. Vol. I, Chap. 12, §§ 182-191, pp. 284— 
294, and Stokes on uniform convergence—a paper reprinted in his Mathematical and 
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physical papers, Vol. 1, 1880, 236-313 are also available in the web in the Internet 
Archive. Stokes’s paper is also discussed below, in § 22.5. 

When I taught the course, I advised the students to read all the extracts quickly 
over once and find (at least) one they wanted to proceed with. None of them are 
altogether easy to read, but for a first approach it is possible to let the obscure bits 
wash over you and wait for something more comprehensible to turn up. Sometimes 
a difficult paragraph has an easier second part, or is followed by an easier one. The 
aim is to form some sense of what the extract is about. 

Terminology can be a problem. This is not simply a matter of old words for shared 
ideas, but words for ideas that may be muddled or obscure and in need of refinement. 
A dialogue with the text and your own understanding of the material is required. 
But it is perhaps more important to know what, in a short essay, to include and what 
to leave out. The aim is for you to walk away with a good understanding of what 
the authors did, so that you can state what were the key stages in their arguments, 
what were routine, what was difficult, what was done well and what, if anything, was 
left unclear. The papers by Dirichlet and Schwarz are the most rigorous, and repay 
careful study. At the other extreme, Green’s essay is the most impressionistic, and 
the challenge is to see precisely how he conveyed insight when he could not provide 
rigour. See if you can say precisely where, and in what ways, he appealed to intuition 
to bolster his arguments. 


Chapter 22 
Uniform Convergence 


22.1 Introduction 


Dirichlet’s papers proved rigorously and beyond dispute what Abel had already 
shown by an example: Fourier series can represent discontinuous functions. This 
established that there is an entire class of convergent series that are composed of 
continuous functions, but for which the sum was not continuous and that consequently 
contradicted Cauchy’s theorem on the continuity of the sum of a series of continuous 
functions. But the fact that a statement has been refuted does not mean that it will 
be clear where the incriminating point lies. It is possible that an insight sheds some 
light, certainly, but leaves other parts of the problem shrouded in obscurity. It is even 
possible that an insight can make everything more confusing. After all, Cauchy’s 
false ‘theorem’ made a clear simple statement. 

In fact, it is not clear what the implications for Dirichlet’s work were taken to be 
at the time, and Dirichlet did not comment on the contradiction that existed between 
the proper demonstration of the convergence of Fourier series and Cauchy’s theorem. 
He seems to have thought that refuting Cauchy’s false proof of the convergence of 
Fourier series was enough, and no one in the 1830s seems to have had any better 
idea of what is involved in establishing the continuity of a convergent series of 
continuous functions than Cauchy’s in the Cours d’analyse. Not until in the 1840s 
did there begin to emerge, with difficulty and in differing contexts, other ways of 
considering the continuity and convergence of a series of functions.! 


‘Chapter closely follows Bottazzini § 5.4. The Uniform Convergence of Series, but draws on later 
historical literature, notably Brating (2007) and Viertel (2014). A brief summary of the theorems 
relating to the properties of convergent sequences of integrable, continuous, or differentiable func- 
tions is provided in Appendix B. 


© Springer International Publishing Switzerland 2015 219 
J. Gray, The Real and the Complex: A History of Analysis in the 19th Century, 
Springer Undergraduate Mathematics Series, DOI 10.1007/978-3-3 19-237 15-2_22 


220 22 Uniform Convergence 


22.2 Seidel 


The first person to explicitly denounce the patent contradiction in which the theory 
of series found itself was Philipp Ludwig von Seidel, a former student of Dirichlet 
at Berlin who later studied under Jacobi and Franz Neumann at Konigsberg. In 1847 
he published an article devoted to series that represent discontinuous functions, and 
there he wrote: 


One finds in Cauchy’s Cours d’analyse algébrique ... a theorem which states that the sum of 
a convergent series whose individual members are functions of a quantity x and continuous 
in the vicinity of a particular value of x, is likewise always a continuous function of the same 
quantity in this neighbourhood. It follows from this that series of this kind are not adapted 
to represent discontinuous functions in the vicinity of points where their values jump. 


After briefly outlining the path of Cauchy’s demonstration, Seidel continued: 


Nevertheless, the theorem stands in contradiction to what Dirichlet has shown, that, for 
example, Fourier series also always converge if one forces them to represent discontinuous 
functions — in fact, the discontinuity will frequently be found in the form of those series 
whose individual members are still continuous functions. [...] When one begins from the 
certainty thus obtained that the proposition cannot be generally valid, then its proof must 
basically lie in some still hidden supposition. When this is subjected to a precise analysis, 
then it is not difficult to discover the hidden hypothesis. One can then reason backwards that 
this cannot occur with series that represent discontinuous functions.* 


The theorem that Seidel found to characterize these series was the following’: 

If one has a convergent series which represents a discontinuous function of a 
quantity x, whose individual members are continuous functions, then one must be 
able to give values of x in the immediate neighbourhood of the point where the 
function jumps for which the series converges arbitrarily slowly. 

An outline of Seidel’s demonstration can be sketched as follows. If s(x) is the 
sum of the series and s,,(x) and r,;,(x) are the sums of the first n terms of the series 
and the remainder respectively, then 


S(X) = Sn(x) + rn (*) 
When an increment of 6 is added to x, we have 
s(x +4) — s(x) = [sn(x + 6) = Sn(x)] + [rn + 9) — rn]. 
This equation will decide the continuity or discontinuity of s(x) in the neighbourhood 
of x. Since, in order for s(x) to be continuous, |s;)(x + 6) — s,)(x)| must be < 7, the 


entire question is reduced to studying the behaviour of |r, (x + 6) — ry, (x)|. Now, in 
order for the series to converge, both r;,(x + 6) and r;,(x) must be smaller than an 


2See Seidel, (1847, 35), quoted in Bottazzini, The Higher Calculus, 202. 
3See Seidel (1847, 36-37), quoted in Bottazzini, The Higher Calculus, 202-203. 
4See Seidel (1847, 37), quoted in Bottazzini, The Higher Calculus, 203. 
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arbitrary small p! and p respectively, when n is suitably large. If the function s(x) is 
continuous, then for positive ¢ smaller than p! and p, one has |rp(x +6) —rp(x)| < 
for n > no(e). And if, as 6 tends to zero, there exists an integer N that is the largest 
of the successively determined no such that, forn > N 


Is(x +6) —s(x)|<T+p+p! (22.1) 


then this inequality says precisely that the limit function s(x) is continuous. 

If, however, the number no increases beyond every finite value when 6, begin- 
ning from an initial value 7, tends to zero, then Eq. (22.1) ceases to be true and the 
convergence of the series in the neighbourhood of x becomes arbitrarily slow. This 
led Seidel to ask whether the continuity of the sum of the series by itself implies 
uniform convergence. It was thirty years before the question was answered in the 
negative, in the form of suitable counterexamples constructed by Darboux (1875), 
Du Bois-Reymond (1876), and Cantor (1880). These will be discussed in the next 
chapter. 

Viertel (2012, 110) has made the interesting observation that Seidel mentions 
neither Abel’s nor Dirksen’s doubts about Cauchy’s proof; it seems that it may well 
have been a question that Dirichlet thought it worthwhile for his former student to 
attack. 


22.3 Bjéorling 


The Swedish mathematician Emanuel Gabriel Bjérling published a paper in Latin 
in 1846 on the convergence of series that he then reworked in 1853 in Swedish, 
when he took the opportunity to claim that his result was equivalent to Cauchy’s in 
his (1853) and was therefore correct. It was argued in Grattan-Guinness (1986) that 
Cauchy must have read the Latin version by 1852 at the latest, which gives Bjérling’s 
paper an extra level of interest, and so we briefly consider it here, following (Brating 
2007), which should be consulted for the details, including an English translation of 
Bjorling’s Swedish article. 

Bj6érling’s theorem says® 


If a series of real-valued terms f|(x), fo(x), f3(x),... is convergent for every value of x 
from xo up to X, and in addition its particular terms are continuous functions of x between 
the given limits; then the sum f| (x) + fo(x) + f3(*) +--+ necessarily has to be a continuous 
function of x between the given limits. 


>Grattan-Guinness argued on the basis of similarities between the two papers including the citations 
and the fact that Bjorling had also published some results on convergence teats in Liouville’s Journal 
that Cauchy would surely have seen. It is no argument against this view that Cauchy did not mention 
Bjorling by name in his (1853). 

See Bjérling (1846, 21) quoted in Brating (2007, 520). 
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The obvious problem with this is that Abel’s exception, the Fourier series 
(-1)e1E sin(kx) would still seem to be counter-example, so everything hangs 
on how Bjorling interpreted the statement when he came to prove it. If we fix a value 
of x and ask that the limit function be continuous at x the statement is false. If we 
ask that the series converge for all x then the corresponding statement in epsilons 
and deltas can be made to look like a claim of uniform convergence. Brating’s view 
is that Bjorling was alert to the difficulties but insufficiently precise. She argues that 
Bjorling intended that the difference between a condition holding “for every value of 
x” and “for every given value of x” is that the former expression holds for x-values 
indefinitely close to a particular point and that the latter expression does not. In other 
words, a condition that holds for all x means that it holds on some interval around 
any given x. 

There has been an extensive discussion of whether Bjérling possessed the concept 
of uniform convergence, which can be traced through Brating’s paper. Her view is 
that this imposes modern conceptions on his work and that an ultimately unclear 
sense of what it is for a function to be continuous might have been at play that was 
more typical of the period, and we can note that this contributes to the argument that 
the « — 6 — N formalism was needed in delicate matters like this one. 


22.4 Gudermann and Weierstrass 


By the time Seidel’s paper appeared, other mathematicians were already picking out 
the key issue of what is today called the uniform convergence of a series of functions. 
Among them was Weierstrass, who was probably inspired by the article Gudermann 
published in the 1838 issue of Crelle’s Journal. In it, Gudermann hinted the uniform 
convergence of certain infinite series that give the expansion of elliptic functions for 
the first time. As he put it, “It is a fact worth noting that ...the series just found have 
all the same convergence rate”.’ Weierstrass was more precise®: 


Because the power series under consideration ...converges uniformly, given an arbitrary 
positive quantity 5, a finite number of terms of the series can be discarded so that the sum of 
all the remaining terms is, for every value in the specified domain ...in absolute value < 6’. 


In his lectures Weierstrass gave the now-classic definition of uniform convergence 
in an interval: 


Definition 1 (U;) The series Xu, (x) is said to be uniformly convergent in an interval 
[a, b] when, for every arbitrarily small positive e, there exists an no(€) such that 
I‘n(x)| < € forn > no and for every x,a <x <b. 


7See Gudermann (1838, 251-252). However, as Viertel has noted (2012, 142-144), Gudermann 
seems to have used this and other terms about the nature of convergence impressionistically and 
without defining them; other terms refer to the rate of convergence, which may well have meant the 
number of terms n that must be considered before a sequence is within a given ¢ of its limit. 


8See Weierstrass (1841b, 68-69). 
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As Bottazzini has remarked, “this is not the sense in which Seidel had introduced 
his notion. Seidel had not been interested in the ‘global’ properties characterizing the 
convergence of a series in an entire interval, but rather in the small but predetermined 
neighbourhood of a point.” That concept is captured by this definition: 


Definition 2 (U2) The series Xu, (x) is uniformly convergent in the neighbourhood 
of a point ¢ of the interval [a, b] if there exists a 6(¢) such that |r, (x)| < ¢ for every 
arbitrarily small positive ¢, for n > no(¢, €) and for |x — ¢| < 6. 


To quote Bottazzini again: “The uniform convergence of a series in an interval 
as defined by (U;) naturally implies uniform convergence in the neighbourhood of 
every point in the interval. The converse is also true, but the demonstration is not 
easy and was given by Weierstrass for the first time in 1880.° 


22.5 Stokes 


A contemporary of Seidel, but one as ignorant of his work as he was of Weierstrass’s, 
was a young mathematical physicist who was investigating the question of the mode 
of convergence of series, the Irishman George Stokes. Stokes was one of the first 
to establish Cambridge as a leading centre in the world for applied mathematics, a 
topic the utilitarian Victorians were always much more willing to support than pure 
research. !° He was familiar with the work of French mathematicians such as Cauchy 
and Poisson, but was almost completely ignorant of what Dirichlet had done, so his 
account of the convergence of Fourier series followed Poisson’s account instead. 
He therefore approached the study of series using Cauchy’s false ‘theorem’ on the 
continuity of the sum of a series of continuous functions, which he took not from the 
Cours d’analyse but rather from Moigno’s Lecons de calcul différentiel et de calcul 
intégral of 1840-1844, which in turn was based on Cauchy’s lectures. However, 
Stokes was cautious where Cauchy had been rash. He used Cauchy’s definition of 
the continuity of a function and adopted his distinction between ‘convergent’ and 
‘divergent’ series, but introduced an additional distinction between ‘essentially’ and 
‘accidentally’ convergent series. This corresponds to the modern distinction between 
absolute and conditional convergence, also introduced at the same time by Dirichlet. 

Stokes set out to prove the convergence of the Fourier sine series of a class of 
functions, although what class it could be exactly Stokes never made clear. He first 
assumed that the Fourier series of a function f converges absolutely (what he calls 
essential convergence). He then used the method of Abel summation to show that 
the series converges to the function it is supposed to represent. This involved him in 
finding a second series that depends on a parameter g and converges for 0 < g < 1, 


°See Weierstrass, ‘Zur Functionenlehre’, (1880). Today the result is obtained by applying the 
Heine—Borel theorem. 

'0He was also a descendant of the tradition begun by Peacock and Babbage, as can be seen by his 
use of the first difference operator, A. 
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with the further property that if this second series converges as g tends to 1 from 
below then the first series will converge correctly. 

His argument is delicate and worth describing. To get the sum of the Fourier series, 
he used a standard trigonometric identity, 2 sin A sin B = cos(A — B) —cos(A+ B). 
To show that the summed g-series agrees in the limit as g tends to | with the function 
where the function is continuous, and takes the appropriate average value at the 
points where the function has a finite jump discontinuity, he argued as follows. Fix 
attention on a point x in the interval of integration [0, a]. In any range of the dummy 
variable x’ not including x, the limiting value of the integral is 0. When the range 
of integration, however small, includes x the limit as g tends to | of the integral is 
improper and a subtle analysis yields the conclusion. Here Stokes, without saying 
so, made further assumptions about f that shows he knew what continuity at a point 
means (his verbal definition of continuity earlier is not clear)—or, if you are less 
generous, that he did not know what a merely continuous function could look like, 
and supposed it to be locally monotone. 

The next paragraph shows that the original Fourier series does in fact converge, 
so the Abel limit as g tends to | is equal to this limit, and therefore the origi- 
nal series converges as the g-series does: pointwise to the function except at jump 
discontinuities. 

The rest of the paper is less exciting, but it does at least show that Stokes knew 
that term by term differentiation of a convergent Fourier series is not generally valid. 

The most original point in Stokes’s work lies in his study of Fourier series in the 
neighbourhood of a point of discontinuity of the function. It is the same problem as 
Seidel’s but Stokes followed a completely different approach. In his words, he dealt 
with the following question: 


Let 

Uy tugt-esstuyte:: (22.2) 
be a convergent infinite series having U for its sum. Let 

vI Ug tes + Uy, +-:- (22.3) 


be another infinite series of which the general term v, is a function of the positive variable 
h, and becomes equal to uv, when h vanishes. Suppose that for a sufficiently small value of h 
and all inferior values the series (22.3) is convergent, and has V for its sum. It might at first 
sight be suEpased that the limit of V for h = 0 was necessarily equal to U. This however is 
not true. 


This is the problem that Abel had treated for the special case of power series—a 
double limiting process. As Stokes went on to show, the question becomes that of 
knowing when we can exchange the order of two limits, and indeed we cannot always 
do this. 

After a general discussion of the issue, Stokes stated and proved the following 
theorem: 


'I See Stokes (1849, 279), quoted in Bottazzini, The Higher Calculus, 205. 
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The limit of V can never differ from U unless the convergency of the series (22.3) becomes 
infinitely slow when h vanishes. The convergency of the series is here said to become 
infinitely slow when, if n be the number of terms which must be taken in order to render the 
sum of the neglected terms numerically less than a given quantity e which may be as small 
as we please, n increases beyond all limit as h decreases beyond all limit.!* 


The English mathematician who successfully brought modern analysis to the 
country (as late as the early 20th Century!) was G.H. Hardy and he shrewdly observed 
that there is a clear distinction between what Stokes meant by an “infinitely slow” 
convergence and Seidel’s “arbitrarily slow” convergence. 


Stokes is considering an inequality satisfied for a special value of n, or at most an infinite 
sequence of values of n, and not necessarily for all values of n froma certain point onwards. !3 


In modern terminology, Stokes introduced quasi-uniform convergence that is defined 
as follows: A series Xu, (x) is quasi-uniformly convergent in the neighbourhood of 
a point ¢ in an interval [a, b] if for every arbitrarily small positive ¢, there exists a 
6(¢) > 0 such that |rp(x)| < 6 for every N and no. 

In relation to Stokes research, it is also important to introduce the definition of 
quasi-uniform convergence at a point x = ¢. This is written like the definition in 
the neighbourhood of a point, with the essential difference that ¢ depends on the 
preceding choice of ¢, ¢, and WN. In his (1878, 107-108) the Italian mathematician 
Ulisse Dini demonstrated that a necessary and sufficient condition for s(x) to be 
continuous at x = (, is that the series be quasi-uniformly convergent at the point ¢. 


22.6 Summary 


It is interesting that it took more than 20 years for Cauchy’s false theorem in his Cours 
d’analyse to begin to be clarified. Abel’s counterexample did not stir a debate— 
it did not even stir Cauchy—and Dirichlet was silent on the subject. Only in the 
1840s did four mathematicians, all unaware of each other’s work and acting with 
different motives and objectives, produce new arguments, and they were all different. 
Bjorling’s position remains ambiguous. Weierstrass introduced the idea of uniform 
convergence in an interval; Seidel that of uniform convergence in the neighbourhood 
of a point; Stokes that of quasi-uniform convergence in the neighbourhood of a 
point. Common to all four is the desire to surmount the idea of simple convergence 
as defined by Cauchy in order to ensure that the (new style) convergence of a series 
of continuous functions guarantees that the limit is itself continuous. 

As we noted before, Cauchy returned to his own theorem only in a note in the 
Comptes rendus for 1853, when he observed simply that it is verifiable for a power 
series, but that it required restrictions “for other series”, and gave the example that 
Abel had earlier given. But Cauchy mentioned neither the Norwegian mathematician 


2See Stokes (1849, 281), quoted in Bottazzini, The Higher Calculus, 206. 
3See Hardy (1918, 155), quoted in Bottazzini, The Higher Calculus, 206. 


226 22 Uniform Convergence 


nor the more recent work of Seidel and Stokes. Instead, as we saw in § 4.3, he tried to 
play down the problem (and with it, the size of his mistake), saying (Cauchy 1853, 
31-32) “it is easy to see how one can modify the statement of the theorem so that 
it will no longer have any exception. This is what I am going to explain in a few 
words”. His new theorem (Cauchy 1853, 33) said, correctly: 


Theorem (of uniform convergence). If the different terms of the series Xu,,(x) are functions 
of a real variable x, continuous with respect to this variable within the given limits; and if, 
in addition, the sum uy, + Un+1 +--+ + uy always becomes infinitely small for infinitely 
large values of the whole numbers n and n’ > n, then the series will be convergent and the 
sum of the series will be, within the given limits, a continuous function of the variable x. 


The language of infinites and infinitesimals that Cauchy used here seemed ever 
more inadequate to treat the sophisticated and complex questions then being posed 
by analysis. Furthermore, when Cauchy wrote this note in 1853, France was even 
less capable of preserving its leadership in research mathematics, as Lamé pointed 
out that very year in an alarming report to the Académie des sciences. He exhorted 
mathematicians to do ‘pure’ research, lamenting that the almost exclusive interest in 
applied questions impeded the development of mathematics. He thereby denounced 
the limitations inherent in the polytechnic tradition. 

The problems posed by the study of nature, such as those Fourier had faced, now 
reappeared everywhere in the most delicate questions of ‘pure’ analysis and neces- 
sarily led to the elaboration of techniques of inquiry considerably more refined than 
those that had served French mathematicians at the beginning of the century. Infin- 
itesimals were to disappear from mathematical practice in the face of Weierstrass’s 
€—6 notation, and even the analysis of functions of complex variables, which Cauchy 
had initiated, was to be reformulated on the basis of the works of Riemann and the 
lectures of Weierstrass. 


Chapter 23 
Integration and Trigonometric Series 


23.1 Introduction 


Riemann’s previously unpublished Habilitation essay on the integration of trigono- 
metric series was published for the first time in 1867, in an issue of the Gottingen 
Nachrichten that came out shortly after his death and carried several of his papers 
that he had left in almost a fit state to print. By then his reputation as a remarkable, 
but difficult, mathematician had spread, and what he wrote was read. This chapter 
considers some of the responses to his ideas about real functions and looks at the 
changes wrought in the relationship between continuity and differentiability, and 
between the implications of convergence and uniform convergence. This will lead 
us to look at the world of functions defined by series that do not converge uniformly, 
and were often considered in the late 19th century to be the most general kind of 
function, which is why they were often called ‘assumptionless functions’. 


23.2 Heine, Cantor, and Nowhere Dense Sets 


Among the first to respond was Eduard Heine. His paper (1870) on trigonometric 
series was animated by the following question: Are the coefficients a, and b, in a 
Fourier series unique if the series is not uniformly convergent, and so term by term 
integration is illegitimate? Heine noted that Dirichlet had proved that the Fourier 
series of a piecewise monotonic continuous function converges uniformly. The con- 
vergence is necessarily non-uniform at jump discontinuities where f(x+) 4 f(x—) 
(a series of continuous functions that converges uniformly converges to a continuous 
function). Heine now observed that the Fourier series converges uniformly in gen- 
eral, as he called it, meaning that it converges uniformly on any subinterval omitting 
a specific finite set of points. 
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Inspired by Georg Cantor, a colleague of his at the University of Halle, Heine 
claimed that the Fourier series representation of a function is unique if the conver- 
gence of the Fourier series is uniform in general. To prove this, he argued as follows. 
If the Fourier coefficients are not unique, consider two Fourier series representing 
the same function. The difference of these two series represents the zero function, so 
the theorem will be proved if it can be shown that all the coefficients of the Fourier 
series formed by taking this difference are 0. In this, he succeeded; that is, Heine 
established that if a function is represented by a Fourier series that is uniform in gen- 
eral with respect to a finite set then the Fourier series is unique, and its coefficients 
are determined by the function. 

Cantor (see Cantor 1872) then developed an approach that dropped the uniform 
in general condition, and aimed at showing uniqueness for any set of discontinuities. 
First Cantor showed that given a trigonometric series > aj sin jx+b; cos jx that 
is convergent on an interval, the coefficients a; and b; tends to zero as j — oo. This 
enabled him to show that if a function defined on an interval is represented by two 
trigonometric series, pe ‘9 @j Sin jx + bj cos jx and pare —9 4; sin jx + bi cos jx, 
that converge and agree Serv where on the interval, then the coefficients of the two 
trigonometric series are the same: aj = a’, j and bj = = bi. Put another way, the only 
everywhere convergent trigonometric series that represents the zero function on an 
interval is the one with all coefficients zero. 

Cantor then investigated what could be said if a function was represented by 
two trigonometric series, >’ aj sin jx + bj cos jx and >° a’ a; sin jx + bi cos jx, that 
converge and agree sveatiene on an jnterval I except ‘at a set P white either 
convergence fails for at least one of them or the two trigonometric series take different 
values. He began with sets P that have only finitely many limit points, or, calling 
the set of limit points of a set P the derived set of P and denoting it P’, sets whose 
derived set is finite. He called these sets of type 1. Sets of higher type he defined 
inductively. A set of type 2 is one whose derived set is of type 1. If P is a set, then 
P’ is its derived set, and the derived set of P’ is denoted P). So P is of type 2 if 
P) is finite. A set of nth type is one for which P™ is finite. Interestingly enough, 
Cantor had almost no examples of the sets he was talking about, and noted only that 
the set 1/n :n = 1, 2,...18 of type 1, and that the set of all the rational numbers in 
[0, 1] is of no type. 

Cantor showed that the representation is unique if convergence holds except on a 
set of type n. His proof was a modification of Heine’s, extended by induction to any 
integer n. 

Now that we know why these sets matter, it will be convenient to notice some 
things about them. A set of type | has a finite derived set, that is, a finite set of points 
p’ each with the property that every neighbourhood of p’ contains infinitely many 
points of P. Note that the points p’ need not belong to P. However, the points of P 
somehow bunch up or cluster around the points in P’. Let me introduce a term that 
Cantor was only going to introduce later: a set will be said to be nowhere dense if 
its closure has an empty interior. It follows at once that a set of type 1 is nowhere 
dense, and indeed that a set of type n is nowhere dense. 
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23.3 Assumptionless Functions 


In the 1870s a number of mostly German mathematicians started to pick up on the 
idea that one of the things Riemann had done with his paper on trigonometric series 
was point towards a class of functions with no specific properties at all. They were 
not, for example, analytic, or differentiable, or even, necessarily, continuous. They 
were simply functions, in the sense that given a point in their domain (as it might be, 
the unit interval) they took a value at that point. As matters stood, there was nothing 
that could be said about them. 

That being the case, most mathematicians tried to delineate a class that could 
be discussed, marginalising, at least for the moment, those that defied analysis. For 
example, series that were uniformly convergent could be discussed, series that were 
not lay beyond the pale. Functions that were integrable were admitted, functions 
that were not were excluded. This, of course, invited discussion of what it is for 
a function to be integrable, and the feeling of du Bois-Reymond and many other 
mathematicians was that Riemann’s idea was the best possible: if a function was not 
Riemann-integrable it was not going to be integrable on any other conception of the 
integral worth having. 


Hermann Hankel 


In 1870 the mathematician Hermann Hankel published his dissertation at he Univer- 
sity of Tiibingen. After his early death in 1873 at the age of 34 it was republished 
in 1882in volume 20 of the Mathematische Annalen, because it was recognised 
as an important contribution to the study of assumptionless functions. In it Hankel 
presented an important technique for constructing functions that are far from being 
continuous, an analysis of at least some of the ways such functions can behave, and a 
theorem about the integrability of some, but not all, of these functions. That his main 
theorem turned out to be incorrect is further evidence, for us, of just how difficult 
the fundamental ideas of real analysis were proving to be. 

Hankel had been a student of Riemann’s in 1860 before going on to Berlin in 1861 
to study under Weierstrass and Kronecker. His analysis was motivated by a desire to 
go, as Riemann had done, beyond the safe class of functions studied by Dirichlet. 
That meant dispensing with one or both of the requirements that the function by 
piecewise continuous and piecewise monotonic. 

First, discontinuity. If a function f defined on an interval J is not continuous at 
the point a € J then it must be the case that for some ¢ > 0 there is no 6 > O such 
that |x —a| < 6 => |f(x) — f(@| < e, and that means there is at least one x in 
any interval containing x9 with the property that | f(x) — f(a)| < ¢. Such an ¢ is 
called a jump of the function f at xo, and the least upper bound, denoted o of all 
the jumps of f at xq Hankel called the jump of f at xo. I shall write it J¢ (xo). It 
will also be convenient to define Sf(o) = {a € I: J¢(a) > o}. Hankel did not 
define these symbols, which may indicate how far mathematicians still were from 
thinking set-theoretically. To get beyond Dirichlet’s class of functions, Hankel had 
to contemplate functions discontinuous at infinitely many points. 
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Second, functions that are not piecewise monotonic on an interval. These are 
functions that are not monotonic on a finite number of intervals. Hankel referred to 
them as functions with infinitely many maxima and minima, which is a reasonable 
way to think of them, and he gave the examples of the function which is sin 1/x 
when x 4 0 and some value, say 0, when x = 0, and the function x sin 1/x. 

Because Hankel’s main theorem is incorrect it will be convenient to follow it 
slowly and carefully, but it is helpful to know where he was going. His aim was 
to divide highly discontinuous functions into two classes and show that one class 
was integrable and the other was not. He used Riemann’s approach to the integral, 
according to which the crucial property of an integrable function is not continuity but 
the wildness of its oscillation. So his division was into functions that do not oscillate 
too much, and those that do, and that led him to consider the set of points at which a 
function oscillated a lot: was this set, so to speak, small (my term) in which case the 
function would be integrable, or large (my term again) in which case the function 
would not be integrable. Such, at least, was his hope. 

What Hankel meant by small and large sets is captured by his terminology. He 
said that a set A was scattered in an interval J if for all pairs aj,az € A,a, < a2 
there is a subinterval J of J such that J C (a,, a2) and JM A = @. Such sets are 
small. This corresponds to the modern concept of a nowhere dense set (the interior 
of the closure of the set is empty). He said that a set A filled an interval J if every 
subinterval J C J meets A, so JM A # VW. Such sets are large. This corresponds to 
the modern concept of a dense set. Hankel seems to have thought that a subset of an 
interval was either nowhere dense or dense—this is incorrect, but it is evidence of 
just how hard these concepts were to master when they were being introduced. 

Hankel now looked at the class of what he called linear discontinuous functions 
(linear here means defined on an interval). These are functions defined on an interval 
I with the property that they are discontinuous at infinitely many points. He divided 
this class into two subclasses. The first, which he called pointwise discontinuous, 
consists of the functions f for which the set S(o) is scattered in J for all o > 0. 
The second class, which he called the totally discontinuous functions, consists of 
functions for which there is ao > 0 such that S(o) fills at least a subinterval of J. 

Hankel introduced a further concept, which I shall call the total length of a cov- 
ering. Let |/| denote the length of an interval so if J = (a, b) then |J| = b—a. Ifa 
set A is covered by a finite set {J;} of intervals, then the total length of the covering 
is Ly = >¢; |{c|. The crucial quantity is the least upper bound of all finite coverings 
as the number of intervals in the covering increases arbitrarily. Hankel was most 
interested in the sets for which this quantity is zero; I shall call them zero coverable. 
There modern name is (temporarily) withheld to make the history stand out more 
clearly. 

We can now indicate the thrust of Hankel’s argument. He believed that a linear 
discontinuous function was either pointwise or totally discontinuous and (Riemann) 
integrable in the first case and not (Riemann) integrable) in the second. This is very 
tidy, and it is in fact correct that a function totally discontinuous on an interval 
is not (Riemann) integrable. But the other way round is false: it is not true that a 
pointwise discontinuous function is (Riemann) integrable. This amounts to saying 
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that a nowhere dense set is zero coverable, and that, as we shall see, is not true. The 
most likely reason for his mistake is an impoverished idea of what nowhere dense 
sets can be like, and historians have noted that every nowhere dense set in his paper 
was like a typical one of his examples of such a set, the set {ar t= 1, 23 ce): 


Du Bois-Reymond 


Du Bois-Reymond was another mathematician who thought that the class of Riemann- 
integrable functions was the largest subclass of the ‘assumptionless’ functions worth 
discussing, and that Riemann’s definition of integrable was as broad as possible. He 
is worth mentioning here for another attractive family of sets he introduced in a paper 
of 1875. These are the zero sets of sin(1/x), sin(1/sin(1/x)),... are they indeed 
sets of type 1, 2,...,,.... Unfortunately, du Bois-Reymond seems to have believed 
that these sets, which are nowhere dense in any interval, are the only possible sets 
of type n for some appropriate n. If so, he would have believed that all nowhere 
dense sets are zero coverable. Certainly, nowhere in his work did he find fault with 
Hankel’s account. 


Henry Smith 


It seems that the first mathematician to sort this out was the Oxford mathematician 
Henry Smith. He produced two (or, as you will see, perhaps we should say three) 
interesting examples of families of sets for the purposes of integration theory. The 
first family was constructed this way. The set P; consists of all the rational numbers 
of the form 1/n , for positive integers n,. The set P) consists of all the rational 
numbers of the form 1/n, + 1/n2 for positive integers n; and 12, and so on, so the 
set P; consists of all the rational numbers of the form 1/n; + 1/n2+...+ 1/nx 
for positive integers n|,2,...nx. It is a nice exercise to confirm Smith’s result that 


the jth derived set of Py, k > j is py? = {0} U P,;...U Py_;. Smith showed that a 
function which is discontinuous on a set Px is integrable, because these sets are zero 
coverable. 
The second family you may recognise under another name. Smith’s term for 
nowhere dense sets was sets ‘in loose order’ .! 
Let m be any given [integer] greater than 2. Divide the interval from 0 to | into m equal parts; 
and exempt the last segment from any subsequent division. Divide each of the remaining 
m— | segments into m equal parts; and exempt the last segment of each from any subsequent 


division. If this operation be continued ad infinitum, we shall obtain an infinite number of 
points of division P upon the line from 0 to 1. These points lie in loose order .... 


To understand this, look at what Smith defined that produced a nested family of 
sets. At stage zero, so to speak, we have the unit interval [0, 1]. At stage one, we 
have m — | equal subintervals that together form the interval [0, nd ]; the half-open 
interval (2 , |] at the end has been excluded (““exempted”, as Smith put it). At stage 
two, we have (m — 1)? intervals, and so on. 

We get a good account if we let m = 3 and label points in the interval 


[0,1] by ternary decimals. A ternary decimal is an expression of the form 


'See Smith (1875, 147), quoted in Hawkins (1975, 38). 
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0.10221002011102..., where the entries are either 0, 1, or 2, and the corresponding 
real number is written in powers of (1/3). So the above ternary decimal is equal to 


Lys OAS eo 1 es ay ay? ee 


We can now see that Smith’s process with m = 3 first of all drops all the ternary 
decimals that begin 0 - 2..., because they occupy the end interval, then it drops all 
the ternary decimals that begin 0-02... or0-12..., because these ternary decimals 
occupy the final thirds of the intervals [0, 1/3] and [1/3, 2/3] respectively. 

The set P that Smith is interested in is the intersection of the sets obtained at each 
stage, and we can see that it will consist of all ternary decimals that never contain a 2. 

The set thus obtained is the first ‘Cantor’ set, S. Itis nowhere dense, its complement 
at the nth stage is of length (1 — 1/m)", which tends to 1, so the set is zero-coverable 
and a bounded function discontinuous at precisely the points of S is integrable. 

Smith then modified this family (or produced a new family, if you prefer) which 
is still nowhere dense but not zero-coverable, so a function discontinuous on this 
set is not integrable, and therefore nowhere dense does not imply zero coverable. 
This he did as follows. Again, let m be any integer greater than 2, divide the interval 
from 0 to | into m equal parts, and exempt the last segment from any subsequent 
division. But now divide each of the remaining m — 1 segments into m? equal parts; 
and exempt (i.e. exclude) the last segment of each from any subsequent division. If 
this operation be continued ad infinitum, the points that have never been excluded 
form an infinite number of “points of division” Q in the interval [0, 1]. This set Q is 
nowhere dense. But now at stage n the complement of the set is length (1 — 1/m)(1— 
1/m7)...(1 — 1/m") which does not tend to zero as n tends to infinity but instead 
tends to the positive limit IT?°(1 — 1/m”). For example, if m = 3 this limit is over 
one-half (at about 0.560126). So the set Q is not zero coverable. 

The name “Cantor set’ is derived from the very similar set that Cantor described 
in his (1883). This set is obtained by following Smith’s construction with m = 3 
but excluding the middle third each time. In terms of ternary decimals, the resulting 
‘Cantor set’ consists of all ternary decimals that never contain a 1. 

This illustrates the sad truth that Smith’s works were not much read on the Conti- 
nent. So important is the set Q, however, that it really deserves a name, and because 
the name ‘Cantor set’ is surely permanent in the literature, the set Q will here be 
called a Smith—Cantor set. Its merit is that it is a nowhere dense set in the unit interval 
that is not zero coverable. 

So we see that there was a confusion, and not just in Hankel’s work, between a 
set being nowhere dense and its being zero coverable. The confusion may have come 
from the fact that Cantor’s sets of type n are nowhere dense and are zero coverable. 
This may well have suggested that the right way to single out point sets that are 
negligible for the purposes of integration—the zero coverable sets—is to select the 
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nowhere dense sets.” This entirely topological criterion was not, however, the right 
way to go, as the Smith—Cantor set indicated. 

Some side remarks with which to conclude. Smith, in this paper, also disproved 
Hankel’s theorem that if a function f is Riemann-integrable then all its one-sided 
limits f(x—) and f(x+) exist. This was also disproved by Johannes Thomae in 
1875, who gave this example: f(x) = sin(1/sin(/x)), which is integrable, but 
the discontinuities (which are at x = 1/m) are such that f(x+) and f(x—) do 
not exist. Thomae also failed to find fault with Hankel’s identification of piecewise 
discontinuous functions with the Riemann-integrable ones. 

Here’s a digression that is somehow relevant: The Taylor series of P(x) = (1 — 
x)(1—x)...(1—x")... begins 


12 15 4 422 4 726 _ 35 _ 404 51 4 57 4 (60), 


jax —27 +37 4%! =x 
The exponents are the pentagonal numbers 5n(3n+ 1),n=0,—-1,+1, —2,+2,.... 
Verify Euler’s result that 


[o,e) [oe 


[Ja —x/)= > (—1)4 x2 Gat D/2, 


j=l j=-o0 


Euler discovered this result in 1741 and published it in 1751, in E 158, and also in 
his Introductio (1748) Chapter XVI, § 323; see also Weil (1984, 279). The series is 
intimately involved in the study of the number of ways a given number can be written 
as the sum of the numbers 1, 2, 3, ...m. You may check that, for example, 7 may be 
written as the sum of numbers less than or equal to 7in 15 ways, as Euler observed 
in his Introductio § 324. 


23.4 Beyond Uniformity 


Once the concept of uniform continuity was elucidated, it was clear both that the 
uniform convergence of a sequence or series of continuous functions ensured that 
the limit function was continuous, and that the uniform convergence of a sequence 
or series of integrable functions ensured that the limit function was integrable and 
also that term by term integration was valid. 

We have already seen that the non-uniform convergence of a sequence or series 
of continuous functions is not enough to ensure that the limit function is continuous. 
We therefore turn to considering if a non-uniform limit of integrable functions has 
any useful properties. 

Let us fix some notation and some ideas. Suppose that (f;), kK = 0,1,... 
is a sequence of integrable functions on an interval [a, b], and define F,(x) = 


21 can now withdraw the term ‘zero coverable’ in favour of the later term ‘measure zero’. 
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a Sx (x). Let the series F;,, tend pointwise to a limit function F(x). We can now 
do two things. 

The first is to ask if this limit function F(x) is integrable. If it is, we naturally 
write the integral as [* F(t)dt. We mean by this [* (limp—oo Fy(t)) dt. 

Another thing we can do is consider the integrals [ E F,(t)dt (certainly the func- 
tion F,, is integrable, being a finite sum of integrable functions). This is term-by-term 
integration of the series. Then we consider the limit as n increases of these integrals, 
if it exists. If it does, we write it as limy—o iM Fy (t)dt). 

Now we can ask if these limits are the same. In symbols, does I i (limy-so0 Fy (t)) 
dt equal limy—oo (f;° Fn(t)dt)? 

The answer to all these questions is yes when the convergence is uniform, so the 
question is: what happens when it is not? 


A Non-integrable Limit of Integrable Functions 


First of all, it is entirely possible that a limit of Riemann-integrable functions is not 
Riemann-integrable. To obtain a sequence of (Riemann)-integrable functions tending 
to a non-integrable limit, take an ordering on the rationals between 0 and 1, and for 
each integer n enclose the first n rationals, 71,...,7,..-n, in the open intervals 
Ign = (1R- n> (2, re + n 7/2), k = 1...n of length +. Denote the union of 
these intervals by S,, so S, := Ue Ik.n. Let Xn be the characteristic function on 
the complement of S,, in [0, 1]. Trivially, x, is integrable for each n. 

Let S$ := limy_+ oo S,. Note that this is not Wek Sn, because a new set is acquired 
at each stage. In fact S = {x |dm : x € S,Vn > mt}. 

In fact, S = Q. Certainly r, € S, and indeed r, € S, for allm >n,soQc S. 
Let ¢ € [0, 1] — Q. Then ¢ € S implies that there is an m such that ¢ € S,, for all 
n>m,andso¢é € pei Ik.n for alln > m. If this holds for a constant value of k 
then the intersection is contained in any arbitrarily small interval around r;, which 
is to say the intersection is rz, but this is a contradiction because ¢ is irrational. If 
on the other hand ¢ € Up Ik.n for alln > m for at least two distinct values of k 
then ¢ € Ip.n 1 I,» for all n > m, but this is impossible because that intersection is 
empty. So S = Q, as claimed. 

It follows that the sequence of characteristic functions x, is a sequence of inte- 
grable functions tending to the characteristic function on the set [0, 1] — Q, and this 
is obviously not Riemann-integrable. 


Darboux’s Example of the Failure of Term by Term Integration 


Darboux, in his fascinating Mémoire of 1875, showed that a uniformly convergent 

series of integrable functions is integrable and can be integrated term by term. He 

also gave this example of a series that converges, but not uniformly, to an integrable 

(indeed, a continuous) function, for which the result of term by term integration of 

the series does not agree with the integral of the function defined by the series. 
This is the series s(x) = 7, Un whose nth term is 


22, 29 
Un = —2n2 xe x ya 2(n + 1)27xe7@tD x . 
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Fig. 23.1 The functions 
U3, Ug, U39 in Darboux’s 261 
sequence 
104 
_ \ _ 
4 “0.5 i 05. 1 15 2 
-1 
20) 
3 2): 
If we write f,(x) = —2n?xe—"*) then uy (x) = fn(x) — fa41(x) and so 
N 
sy(x) = > un (x) = file) — fnsi(). 
n=1 


Figure 23.1 shows the 3rd, 9th, and 30th terms of the sequence of functions 
involved (they get narrower and higher). The sharp, moving peak suggests that the 
convergence will not be uniform. 

Figure 23.2 shows 


8500(«) — S1o0(x) = fio) — fso1 (x), 


the difference between the sum of the first 500 and the sum of the first 100 terms, so 
it gives an impression of the tail after the first 100 terms that define the series. Notice 
how narrow it is, and how big. 

To prove Darboux’s claim, we note that the series s(x) converges to the function 
fi) = —2xe-*’, which can be integrated from 0 to x and the integral is er 1, 
Term by term integration of the series s(x) over the same interval, however, gives 
the sequence defined by 
2. 


Un(x) = (eo * = )= (et DPX? -D= gee _ eo nt lyPx? 


which has partial sums 
2 22 
ew — e N+) x 
2 es 
and so the sum of the term by term integrals in the limit is e~* . This is because the 


convergence is non-uniform, as Darboux explicitly noted, the tail 3°) un (N) being 
2N/e. 
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Fig. 23.2. The tail in Darboux’s limit function 


23.5 Weierstrass 


Shortly after becoming a professor of mathematics in Berlin, Weierstrass taught a 
course on the theory and applications of trigonometric series and definite integrals, 
related to the problem of representing arbitrary single-valued functions of one real 
variable in Fourier series and integrals. Many years later, in a letter to Schwarz on 
March 14th, 1885, he wrote that “the lack of rigour” he found in all the relevant 
works he had at hand, and his “at that time unfruitful efforts to supply this lack” 
convinced him not to teach that course again. 

Weierstrass was among the few to be dissatisfied with Riemann’s definition of 
the definite integral. He expressed his concern about that, as well as his attempts at 
extending Riemann’s definition, in letters to his friends and students. Writing to Paul 
du Bois-Reymond he lamented he had always complained that Riemann’s definition 
had a limitation which he had to work around because he did not know how to put 
right. 

It has always seemed to me that Riemann’s definition of the definite integral is subject to a 

disadvantage which I had to ignore since till now I did not know how to remove it. It is this: 

in determining the greatest oscillation that the value of the function to be integrated makes 

in an interval (therefore in the statement of the integrability condition), those values it has 

at the points of discontinuity are considered while, however, the value of the integral, when 


it exists, depends solely on the values which the function takes at the points of continuity. 
(1923, 215). 


On April 20th, 1885 he wrote that the definition in question should be modified 
“more energetically than he himself had done”. On 16th May 1885 he wrote Sophie 
Kovalevskaya that “Riemann’s definition, which is believed to be the most general 
one could think of, is insufficient”.+ Less than ten days later, on May 25th he repeated 
this statement word for word to Schwarz, adding that “it is neither general enough nor, 
above all, admissible. Perhaps it must be replaced by another, very different one, for 


3The letter was reprinted in Acta Mathematica, vol. 39 (1923), 199-225, quote from 215, and is 
taken from Hawkins (1975, 67). 


‘In Acta Mathematica, vol. 39 (1923), p. 215. 
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whose foundations Cantor’s newest researches offered to me essential services’’. This 
was connected with the problem of the representation of functions by Fourier series, 
and with the concluding remarks of Dirichlet’s 1829 paper in particular. Weierstrass 
had begun to study this subject thoroughly in early 1885, as he announced in a 
letter to Schwarz on March 14th, 1885. He made it the object of a paper (Weierstrass 
1885), and eventually he expounded it in his summer 1886 lectures in Berlin. There he 
presented his approximation theorem and his critical remarks on Riemann’s definition 
(Weierstrass 1886, 110-112). In spite of all his efforts, however, Weierstrass was 
unable to succeed in finding the sought-for extension. 


Chapter 24 
The Fundamental Theorem of the Calculus 


24.1 Introduction 


It seems that our intuition, or at least the intuitions of 19th Century mathematicians, 
was that a function is naturally differentiable. It is differentiable, they felt, unless it 
is obviously not, perhaps because it has a sudden change of direction or because it 
is not even continuous, but such events would only happen at isolated points. 

Insofar as this was their belief they were totally mistaken, and once they had 
sufficiently precise definitions they were gradually able to discover their error, but 
this was a process of discovery that had to be made many times, and led on the 
way to interesting discoveries of suggestive compromises. What, for example, of the 
fundamental theorem of the calculus? Could it be that if a function has a derivative 
then the derivative is necessarily integrable (and its integral differed from the original 
function by only a constant)? Or might there be differentiable functions with non- 
integrable derivatives? 


24.2 Bernhard Bolzano 


The first person to see clearly that continuity and differentiability had very little to 
do with each other (beyond the simple fact that differentiability implies integrability) 
was the obscure figure of Bernhard Bolzano. He became much better known in the 
20th Century than he ever was in his own day (1781-1848). For most of his adult life 
he was banned from teaching—he was an ordained priest of the Catholic Church and 
a professor of theology at the University of Prague—on the grounds of heresy. This 
meant that his voluminous philosophical writings were by and large not published. 
Some of his mathematical work, which interests us here, was published, but only 
obscurely and it only created a stir when it was re-read after its salient novelties had 
been rediscovered independently by others. Much remains to be done in rediscovering 


© Springer International Publishing Switzerland 2015 239 
J. Gray, The Real and the Complex: A History of Analysis in the 19th Century, 
Springer Undergraduate Mathematics Series, DOI 10.1007/978-3-3 19-237 15-2_24 


240 24 The Fundamental Theorem of the Calculus 


Bolzano’s significance for the history of mathematics, but an auspicious start has 
been made in the 21st Century with the publication of The Mathematical Works of 
Bernhard Bolzano (Russ 2004). 

Bolzano is a fascinating figure, driven, it would seem by a philosopher’s desire for 
clarity to scrutinise the foundations of the calculus and, more importantly, to rework 
them. It was not original in the 1820s to find fault with those foundations, and in 
particular with the way that intuitive geometric considerations were smuggled into 
proofs about infinitesimals and limits where one might suppose that intuition was 
blind. It was highly original to find a way to resolve this problem. 

Bolzano was adamant that geometric considerations should not enter proofs of 
theorems in analysis, most notably the fundamental theorem of algebra. This is 
evident both in the title of his crucial work, the Rein analytischer Beweis, etc. (or, 
A purely analytic proof, etc.)—and in the methods Bolzano used. Indeed one of the 
things Bolzano meant by analytic was non-geometric. Purity of method, as present- 
day philosophers of mathematics call the insistence on the use of only the most 
appropriate methods, was a deep concern of Bolzano when he was younger. But, as 
Russ shows, he seems to have felt that it would earn him few readers and he largely 
gave up the activity for a number of years. Historians of mathematics have had a 
tendency to talk him up as a precursor of those who rigorised the calculus by means 
of a sophisticated limit concept and an emphasis on the arithmetic character of the 
key definitions; Russ argues that this is overdone. Instead he suggests that Bolzano’s 
proof of the binomial theorem may be better appreciated for its way of dealing with 
finite variable quantities and not for its account of something like the modern limit 
concept. Even more importantly, as Russ shows, Bolzano’s definition of what it is 
for a function to be continuous at a point in the Rein analytischer Beweis, etc. seems 
to be the modern one, and therefore clearer than Cauchy’s definition a little later, 
who only considered continuity on an interval. 

Cauchy, as we have seen, was clear about limits but not clear by later standards 
about the real numbers. Bolzano certainly had and used the concept of what are 
called Cauchy sequences today. He claimed that if a sequence of quantities u,,, say, 
has the property that the absolute value of the difference uy, — uy+,- can be made 
arbitrarily small for all values of r by taking n large enough, then the sequence 
converges to a limit. Did he too fail to see that this required a precise theory of the 
real numbers? Russ suggests that the insight that sequences with this ‘bunching up’ 
property are defining a number is a profound insight into the nature of number that 
Bolzano was perhaps the first to express. This marks, in fact, his contribution to the 
Bolzano—Weierstrass theorem. 

What excited mathematicians in the 1930s (!) about Bolzano’s work was his 
description of a continuous, nowhere differentiable function. What a remarkable 
thing to look for, and what a delicate microscope is needed to exhibit it. His example 
rested on the observation that it is always possible to put a crinkle into the graph in 
any interval, no matter how small. Start with, say, the function f(x) = 1—2|x—1/2| 
on the interval [0, 1]. Replace each sloping segment with a zig-zag in three parts, 
the first and last of which are steeper than the segment they replace and which are 
joined by a short segment of the opposite slope and passing through the midpoint of 
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the segment they are replacing. Do this indefinitely, and there is every chance that 
the graph you end up with is that of a function which is continuous in the interval 
[0, 1] but nowhere differentiable—but that requires some care to establish! 


24.3 Non-differentiable Functions 


The recognition by mainstream mathematicians that continuity does not imply differ- 
entiability seems to have occurred to Riemann and Weierstrass separately by 1861. 
Hankel took Riemann’s strange function r(x) and defined 


F(x) = fa. 
0 n=1 


He showed that this function is continuous but fails to have a derivative at a dense 
set of points. 


Hankel 


Riemann and Weierstrass confined their examples to lectures, but Hankel went into 
print. He observed that the technique for constructing a singular point of a function 
at one point can be made to yield functions singular at infinitely many. This he called 
the method of condensation of singularities. Once you see the trick, it’s very easy. 
In his example, he supposed he had a function, y(x), whose values were bounded 
on the interval [—1, 1] by +1 and which was infinitely differentiable on the interval 
[—1, 1] except at the origin—you might take the example g(x) = x sin(1/x). The 
origin is called a singularity because the function is not differentiable there. Hankel 
then constructed a function in which the single singularity of y(x) so impregnated 
a function f(x) that it had infinitely many singular points in every interval (Hankel 
1882, 78). This was the function 


fx) By le 


n=1 


where s > 3—the factor n~* is just there to ensure convergence. As Hankel checked, 
this function is singular at every rational point in the interval [—1, 1]. 

The next graph, Fig. 24.1, is of the function g(x) = x sin (4). 

More generally, suppose the function y(x) has a singularity at the origin (you may 
continue to think of g(x) = x sin(1/x)). Let {a,} be your favourite infinite discrete 
set of points, and consider the sum >", (¢(x —a,). If this sum makes sense the function 
it defines has singular points at every a,. How could the sum fail to make sense? Only 
by being infinite, so let us insist that the original function y(x) is never greater than 
1 in absolute value, and consider this sum instead: >”, §G—%) you may use any 
other fudge factor that ensures convergence. This ensures pointwise convergence, 
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Fig. 24.1. g(x) = x sin (+) a 
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Fig. 24.2 Hankel 
condensation on [—1, 1] 
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Fig. 24.3 Hankel 
condensation closer up 1.18 
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and it remains now to check what is surely plausible that the new summation has not 
somehow smoothed out the singularities. 

If you take for y(x) the singular function g(x) = x sin(1/x) then the function 
> zg (sin jmx) should have lots of singularities. This is what the next two graphs 
suggest. Figure 24.2 is a graph of the function on [—1, 1], Fig. 24.3 of the function 
on [0.3, 0.4]. 


Condensing Singularities 


The next two graphs show the effect of taking the function g(x) = x sin (+) and 
applying the method of condensing singularities to it at the points i. Figure 24.4 
shows the first 4 and Fig. 24.5 the first 10 terms of the sum 
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Fig. 24.4 The first 4 points 
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Fig. 24.5 The first 10 points : 
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Hankel’s Example 


Hankel gave this example: 


f(x) = Do g(sinnax)/n', 5 > 2, 


n=1 


where the function g is continuous on [—1, 1] but not differentiable at x = 0, and he 
claimed that the function f(x) is continuous but not differentiable at all the rational 
points. 

What follows are some graphs of Hankel’s function with g(x) = x sin (4). The 
first, Fig. 24.6, just hints at how the singularities appear. 

Evidently something non-differentiable is happening at x = 0 and x = 1/2. 
Figures 24.7 and 24.8 show this in closer detail. 

But weird things are also going on elsewhere, near x = 3/5 and near x = 3/4, 
as Figs. 24.9 and 24.10 show in closer detail. 


Fig. 24.6 A Hankel function 
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Fig. 24.7 A Hankel function near x = 1/2 
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Fig. 24.9 A Hankel function near x = 3/5 
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Fig. 24.10 A Hankel function near x = 3/4 


Schwarz 


Hermann Amandus Schwarz (1843-1921) 


Hankel’s argument’s were not always correct, and they were criticised, but Schwarz 
then came forward with a solid example (Schwarz 1878). He obtained it from the 
function g(x) = |x| + (x — [x])!/2, where |x] is the floor function (as we say 
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today)! by using the method of condensation of singularities to define the function 


00 (gn 
ais > ee 
n=0 


which fails to be differentiable at the dense set of points m/2”. What is striking 
about this example is that the function is strictly increasing. It is possible to think 
that functions which fail to be differentiable do so at points where they oscillate up 
and down too much, but this is not the case. (It resembles the view that a function 
might fail to be differentiable if its derivative ‘should’ be 0, but not otherwise, which 
is implausible as soon as you say it.) 

The next three graphs show the Ist, 4th and 30th stages in Schwarz’s construction 
of function. To my eyes the nature of the limit function is very hard to see (Figs. 24.11, 
24.12 and 24.13). 


Fig. 24.11 The Ist Schwarz 3 
function ae 
2 

15 

1 

0.5 


Fig. 24.12 The 4th Schwarz 
function 5 


'The value of the floor function at x is the greatest integer less than or equal to x. 
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Fig. 24.13 The 30th 6 
Schwarz function 
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A Non-integrable Function with a Bounded Derivative 
Brodén’s Function 


Another good example is obtained from the function g(x) = x!/3 whichis monotonic 
increasing and has a graph that is vertical at the origin. In 1896 Torsten Brodén 
(following in the steps of both Cantor and Weierstrass) asked his readers to let {w,} 
be a dense set of points in [—1, 1], and {c,,} a set of positive numbers, and to define 
the function 


F(X) = Uneng(x — wn). 


This function is strictly increasing, so it has an inverse h which is also strictly in- 
creasing even though its derivative h’ vanishes on a dense set of points (the values of 
f (wn)). So the function h’ is a bounded function that is not Riemann integrable on 
its domain (which is f({—1, 1])). This shows that the fundamental theorem of the 
calculus cannot always be true. 

Animpression of the function f is given by Fig. 24.14, which fits just seven vertical 
tangents into the interval [—1, 1], butit gives an impression of what the finished object 
would be like with its infinitely many vertical tangents densely packed. 


Fig. 24.14 A first 
approximation to Brodén’s 
function 4, 
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Weierstrass 


Weierstrass enters the story here in a number of ways. As we noted above, he had 
heard of Riemann’s ideas about continuous nowhere differentiable functions through 
the grapevine, and had deduced from the existence of complex functions with the 
unit circle as a natural boundary that there are real-valued functions of a real variable 
that are continuous everywhere and differentiable nowhere. 

He also had a student, Leo Koenigsberger, who was a good enough mathematician 
to have known better than to suggest (as he did in his (1874, 13)) that a function would 
be differentiable if it had only finitely many maxima and minima and did not have 
jump discontinuities on a dense set of points (this last condition being intended to rule 
out Riemann’s example). Evidently he did not know of Schwarz’s example, which 
shows that this claim must be wrong. 

Koenigsberger said a copy of his book to his former professor, and Weierstrass 
replied, offering his own counter-example.” 

Weierstrass first distinguished carefully between what it is for a function to have 
some property at every point on its domain, and what is for a function to have that 
property at some point of any interval in its domain. Generally, it is easy to find 
functions of the second kind, but it can be much harder to find functions of the 
first kind. He then took the function g(x) = (1/2)x sin(log(x)), which fails to be 
differentiable only at the origin. He then chose ak < 1 and two sequences of real 
numbers {a,} and {b,},n = 0,1, ..., the first of which is arbitrary and the second 
of which is such that ~h b, is finite. Then he defined 


f (x) = bog(x) + >) bn(g(e = an) + g(% + an). 


n=1 


This function, he indicated, is continuous everywhere, monotonic (so it has a finite 
number of maxima and minima—zero in each case) but it fails to be differentiable 
at the set of points {a,}. Weierstrass observed that the set {a,,} can be taken to be the 
dense set of all algebraic numbers, which Cantor had recently proved was countable. 
He also speculated on whether there could be a nowhere differentiable monotonic 
function with bounded difference quotients, fern ft) and thought that they would 
be were hard to find. Much later Lebesgue showed that in fact they cannot exist. 


>The letter was published in volume 39 of Acta Mathematica, which Mittag-Leffler devoted to the 
life and work of Weierstrass in 1923, and which carried a number of historically valuable sources. 
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Ulisse Dini 


Ulisse Dini (1845-1918) 


The Italian mathematician Ulisse Dini studied mathematics in Pisa before going to 
Paris for the year 1865. On his return he began to doubt the truth of some of the 
propositions that he had heard concerning continuity and differentiability, which the 
publications of Hankel, Schwarz, Darboux and Weierstrass confirmed. He undertook 
further investigations of his own, which he published in his widely read Fondamenti 
per la teorica delle funzioni di variabili reali (or, Foundations of the theory of func- 
tions of real variables) (1878). 

His study rested on four useful and elementary terms, defined as follows. He 
considered the fraction Lath -£@) with 0 < h < b —-x, and let G, and L, be 
respectively the greatest lower bound and the least upper bounds of this fraction in 
the interval stated; they are functions of x and b. Then Dini defined 


Dy. f (x) = limp), Gy = lim inf (feat) 


v h 

D* f(x) :=limp,. Ly = lim sup (fee) 
hyo h 

D_ f(x) :=limpjx Gx = lim inf (atte) 

D™ f(x) :=limp,, Ly = lim sup (e--") 
hilo = 


where the limits are from above (A > 0). Note that lim inf and lim sup are later ideas, 
Dini gave only the first half of these definitions. 
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Dini applied these ideas to the fundamental theorem of the calculus. Cauchy had 
shown 
(1) that for a continuous function f on an interval, 


x / 


{iste 


a 


and also 
(2) that if F is a function with a continuous derivative then 


b 


[FP =Fo Fa. 


a 


However, Hankel had shown that the first of these claims fails for Riemann’s strange 
function on a dense set of points. 


x 
Dini was able to show that if f is integrable and F(x) = { f then 
a 


b 


F@)— F(a)= | DF, 


a 


where D F is any of the four of Dini’s derivatives. So (1) is true in the sense that F (x) 
has four derivatives, and they differ from one another by functions whose integral 
over any interval vanished. This does not mean that they differ by a nothing, merely 
by a function whose integral is negligible. The set of points at which such functions 
could be non-zero was still unclear in Dini’s time. 

As for (2), Dini was able to show that if F is a continuous function and one of its 
four Dini derivatives is integrable (denote it DF) then 


b 


[oF =Fo-Fa. 


a 
In short, the familiar fundamental theorem of the calculus holds for a differentiable 


function F if its four Dini derivatives agree, and if they do not something can still 
often be said. 
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24.4 Exercises 


1. How would you answer somebody (in the 1840s or 1850s) who said that it makes 
no sense to speak of a function being continuous at a point and that continuity 
has to mean something like continuous on an arbitrarily small interval? 

2. How different do you take differentiable and merely continuous functions to be? 

3. In the light of the differences between differentiable and merely continuous func- 
tions, and of suitable examples drawn from his work, what do you suppose 
Cauchy’s concept of continuity might have been? (Be prepared to offer several 
tentative and different opinions). 


Chapter 25 
The Construction of the Real Numbers 


25.1 Introduction 


Much of early 19th-century analysis rested uneasily on an intuitive notion of quantity 
that embraced all the measurable objects for which the natural numbers were inad- 
equate, in particular lengths. Real analysis was the study of real varying quantities, 
and at least informally it was compatible with infinite and infinitesimal magnitudes. 
Cauchy had redefined these infinite and infinitesimal magnitudes in terms of limits, 
but it was still unclear what the domain of real quantities comprised, and it struck 
several German mathematicians that complications in real analysis might be allevi- 
ated by providing a good definition of what the real numbers might be. Here we look 
at Weierstrass’s not entirely successful account, then at Dedekind’s more elegant 
version, and finally and very briefly at the theories of Cantor and Heine. 


25.2 Weierstrass’s Ideas 


One concept that disappeared from analysis under Weierstrass’s influence was that 
of the infinitesimal. It was shut out by his creation in the 1860s of a concept of real 
number, and the identification of ordinary magnitudes with real numbers. This meant 
that the concept of a magnitude could be replaced by that of a real number, and the 
concept of a variable magnitude could be replaced by that of a real variable. In the 
event, Weierstrass’s way of doing this was not adopted, and other ways of achieving 
this goal were preferred, but Weierstrass did identify the main features of the problem 
and its solution. He was clear that a system of ‘numbers’ must be created that exactly 
captured the intuitive properties of finite magnitudes, and that they must be created 
out of rational numbers, whose existence and properties were taken to be more or less 
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unproblematic. That is to say, positive whole numbers were collections of identical 
thought-objects, positive rational numbers were finite collections of units and aliquot 
parts (1/n where n is a integer). Irrational numbers were made up as infinite sums 
of these rational numbers, or rather, as appropriate equivalence classes of them. 


The Bolzano—Weierstrass Theorem 


The details of this construction need not concern us. Of more interest is this major 
achievement in Weierstrass’s creation of a theory of real numbers: his proof that every 
bounded infinite set of real numbers has a limit point. This result (called today the 
Bolzano—Weierstrass Theorem) encapsulates a key difference between Weierstrass’s 
ideas and those of Cauchy. Cauchy was vague on this point, out of a belief that nature 
provided such a limit point. Its existence was somehow built in to the concept of 
magnitude. Weierstrass, who had constructed the real numbers out of more primitive 
concepts, knew that he had to prove that his creation had this fundamental property, 
and that is what he did. As Cantor (1882, 566) and Jourdain (1915, 14) pointed out, 
Weierstrass was the first to avoid assuming that the limits exist that are taken to define 
real numbers. 

One may ask if Weierstrass was creating a class of objects where previously there 
had been none (and only acomforting illusion) or if he thought of himself as providing 
an improved definition of what was present in the world. On the first account he 
would say there had simply not been a foundation for the calculus. On the second 
account there had been foundations which he was now making more precise and more 
amenable to the new level of mathematical precision. It seems that Weierstrass’s 
answer would have been the second, with the proviso that it was only the new, 
mathematically precise system of numbers that could enter into proofs. Mathematics 
was no longer connected to intuition at the level of continuous magnitudes, but at 
the more basic level of the rational numbers, or even at the level below that, of the 
positive integers. 

Weierstrass’s ideas provoked quite some debate in Berlin. As early as 1870 
Kronecker was moving towards a position critical of all talk of real numbers. 
Schwarz wrote to Cantor that Kronecker regarded the Bolzano—Weierstrass prin- 
ciple as an “obvious sophism” and expected to be able to produce functions “that 
were so unreasonable that, despite satisfying all of Weierstrass’s assumptions, they 
would have no upper bounds”. Schwarz and Cantor, then at the start of their careers, 
defended their mentor Weierstrass’s views as indispensable for analysis, and indeed if 
Kronecker ever did seek such functions he failed to find any.! 


'See Ferreirés (1999, 37). 
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25.3 Dedekind’s Construction of the Real Numbers 


Weierstrass’s clumsy construction was soon replaced by two others, one due to 
Dedekind, the other to Cantor. Dedekind came to his in 1858, and wrote about it 
in a letter to Durége, a friend of his from his time in Gottingen, but only published it 
in 1872 when he learned that he would otherwise lose priority in the matter. He had 
been sent a memoir by Heine on the same subject and felt that although he agreed 
with the substance of the memoir his own presentation seemed to be simpler. And 
even as he wrote his own memoir up, he was sent a paper by Cantor that also captured 
what he, Dedekind, saw as the essence of continuity, although he was less persuaded 
of the higher realms of abstract numbers that Cantor conjured up. 

Dedekind’s construction is his famous idea of what are today called Dedekind 
cuts. He had already noted that, given an arbitrary unit of length, every rational 
number can be associated with a unique point on a line, but the converse is false: 
there are lengths that are not measured by any rational multiple of the unit length. A 
cut, as he called it, is a division of the rational numbers into two disjoint sets such 
that every rational number in one set is less than every rational number in the other 
set. It is convenient to call two such sets L for ‘left’ and R for ‘right’, with every / in 
L less than any r in R. Dedekind showed how to consider cuts as numbers, calling 
the necessary work time-consuming but not difficult. 

He distinguished between cuts for which there is a rational number gq, say, in L 
such that / < q <r for all/ in L and all r in R (or, similarly, for a qg in R) and 
those for which there is no such rational number. The former kind of cut he identified 
with the rational number q, while irrational numbers he defined as cuts of the second 
kind. The fact that there are cuts of the second kind was the reason, he said, for the 
incompleteness or discontinuity of the rational numbers. For example V2 is not a 
rational number, but it is defined by the cut which assigns the rational number g to 
L if and only if either g is negative or g? < 2 and assigns the rational number gq to 
R if and only if g is non-negative and q* > 2. This cut is of the second kind. 

Each cut is taken to define a real number, and Dedekind then showed that if one 
attempts to define cuts in the set of real numbers, the cuts are always of the first kind, 
so no new numbers can be obtained by iterating the construction.” 


Beside these properties, however, the domain R possesses also continuity; i.e., the following 
theorem is true: 


IV. If the system R of all real numbers breaks up into two classes A;, Az such that every 
number a of the class Ay is less than every number a of the class Az then there exists one 
and only one number a@ by which this separation is produced. 


Proof. By the separation or the cut of R into A; and A» we obtain at the same time a cut 
(A1, Az) of the system R of all rational numbers which is defined by this: that A; contains all 
rational numbers of the class A; and A9 all other rational numbers, i.e., all rational numbers 
of the class Az. Let a be the perfectly definite number which produces this cut (Aj, A2). If 
B is any number different from @, there are always infinitely many rational numbers c lying 
between a and f. If 8 < a, then c < a; hence c belongs to the class A; and consequently 


2See Dedekind C ‘ontinuity and irrational numbers, 20-21. 
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also to the class A, and since at the same time f < c then £ also belongs to the same class 
A, because every number in Az is greater than every number c in A,. But if 6 > a, then is 
c > a; hence c belongs to the class Az and consequently also to the class Az, and since at 
the same time 6 > c, then also belongs to the same class Az because every number in A; 
is less than every number c in A. Hence every number f different from a belongs to the 
class A, or to the class Ay according as B < a or B < a; consequently a itself is either the 
greatest number in A, or the least number in Ag, i.e. a is one and obviously the only number 
by which the separation of R into the classes A;, Az is produced. Which was to be proved. 


Dedekind insisted that the real numbers he had defined were creations of the 
human intellect, and he went on to insist that the same was true of the rational numbers 
and even the integers. On this basis he spelled out, more clearly than Weierstrass 
had done, what the relationship was between the real numbers he had defined and 
magnitudes along a line. Dedekind saw that if real numbers and magnitudes were in 
a one-to-one correspondence, then it was necessary that whenever a line was divided 
into exactly two sets, such that every member of one set lay to the left of every 
member of the second set, there was exactly one point that marked the separation of 
the sets. 

Dedekind did remarkably little with his cuts. He showed how to add them, and 
how they could be used to make limiting arguments rigorous, but the rest of the work 
(showing how to subtract, multiply and divide them) which is not entirely trivial 
when the cuts define negative numbers, he never wrote down. Maybe it was trivial 
for him. 


25.4 The Ideas of Heine and Cantor 


What of the accounts of the real numbers that Heine and Cantor published, which 
Dedekind thought were inferior to his own? Heine observed that many of Weier- 
strass’s as yet unpublished results would remain dubious until the nature of the real 
numbers was clarified. Heine then gave his account, which began with a purely for- 
mal account of the integers and the rational numbers, and then defined real numbers 
as (equivalence classes of) certain sequences of rational numbers. He then showed 
that repeating this construction did not lead to yet more types of number. Then he 
used his newly-defined real numbers to establish that a continuous function on a 
closed bounded set is uniformly continuous. 

Because they became better known, let us look in a little more detail at Cantor’s 
ideas. They grew out of his study of the possible bad point sets of a Fourier or 
trigonometric series. Once one was aware that these series can misbehave on some 
strange looking sets indeed, it becomes natural to want to know about the mother set, 
the set of all real numbers. So Cantor gave his own definition of the real numbers in 
a paper on trigonometric series of 1872. 

He considered what we (but not he) would call Cauchy sequences, sequences of 
rational numbers {a,} such that the difference |a,4 — d,| becomes, as he put it, 
infinitely small with increasing n. He said that such a sequence had a definite limit, 
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and he then set out the arithmetic of such limits, after noting that another sequence 
a;, may be said to have the same limit if the sequence {ay — a/,} has the limit 0, when 
he said the sequence became infinitely small. So Cantor studied limits where today 
we might study equivalence classes of Cauchy sequences, and it is these limits that 
are to be the real numbers. 

First he defined an order relation on limits, and showed that if a and b are two 
limits than exactly one of the following is true: 


b=a, b>a, b<a 


Then he defined the operations of additions, subtraction, multiplication and divi- 
sion on limits in terms of the sequences defining the limits, noting rather cryptically 
that division will be mildly troublesome for sequences aj, for which some aj, = 0. 

Then he considered what would happen if, from the set of all limits, which he 
denoted B, one repeated the construction and formed the set C consisting of, to use 
our language, equivalence classes of Cauchy sequences of elements of B. Indeed 
he then considered repeating the construction indefinitely often. He noted that the 
new sets would be formally distinct, but showed that there was an order-preserving 
1-1 correspondence between the sets B and C, and so that in that sense the chain of 
constructions stopped after the first stage. 


25.5 Other Theories 


Cantor’s theory almost identical with his colleague Heine’s in Halle, as indeed was 
that of the French mathematician Charles Méray, which, unknown to the German 
mathematicians, he had published in 1869. The Franco-Prussian war may well have 
played some part in this lack of communication. Their approach may be contrasted 
with that offered by two other mathematicians, Hermann Hankel and Johannes 
Thomae. Hankel gave a purely formal account of the integers and the rational num- 
bers (which, indeed, Heine took over) but he despaired of giving a reductive account 
of irrational numbers, because he felt there were simply too many ways in which 
they could be created. Instead, he was content to leave the concept of a irrational 
number rooted in the intuition of magnitude. 

Thomae, another student of Weierstrass’s and also at Halle in the 1870s, published 
abook on the elementary theory of complex functions in 1880, the year after he arrived 
in Jena. Here, and again at greater length in the book’s second edition, he married 
Weierstrass’s theory of rational numbers with the Cantor-Heine theory of the real 
numbers. However, and perhaps without realising just what it could mean to have 
Gottlob Frege as a colleague, Thomae then gave a philosophical interpretation of the 
new, extended, number concept, and provoked one of the many furious squabbles 
that Frege enjoyed. That is another story, but briefly Thomae labelled all numbers 
other than the natural numbers signs, and regarded them as being entirely free of 
content. Their conditions of existence were simply that they had been obtained by 
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abstraction from the rules of arithmetic. He spelled out this position at greater length 
in the second edition of his book, published in 1898. Arithmetic, he said was a game 
played with empty signs according to certain rules. The same, he observed, was 
true of chess, but whereas the rules of chess are arbitrary, the rules of arithmetic 
are such that numbers can be described by simple axioms in an intuitive way which 
permits their use in the study of nature. The formal point of view, he went on to 
claim, dissolves all metaphysical difficulties. Frege—of course—disagreed. Here, at 
all events, is one of the moments when the idea that mathematics might be formally 
very like a game of chess entered the discussion. 


Chapter 26 
Implicit Functions 


26.1 Introduction 


The short history of the implicit function theorem displays a number of characteristic 
features of the history of mathematics in the period we are considering. For many 
years the result was taken for granted, and attention paid to the task of finding explic- 
itly functions defined only implicitly. Then it was realised that there was something 
to prove, but the task was assigned to complex analysis—which says something 
about the state of real variable theory. Only much later was the theorem proved in 
the context of real analysis. 


26.2 Power Series and Implicit Functions 


Almost from the time Descartes published his La Géométrie in 1637 mathematicians 
felt able to pass from functions defined implicitly in the form f(x, y) = Oto functions 
defined explicitly in the form y = g(x), although they allowed that the explicit form 
might be correct for a limited range of the independent variable x and that there might 
be a few values of x where this cannot be done. Their confidence, if it was examined 
at all, rested on the appearance of the graphs of implicitly defined functions. 

The simple case of the circle x? + y* = 1 makes the case. At all points other 
than the extremities on the x-axis there are two branches of the inverse function y = 
+./1 — x2 and it is easy to choose one, but this breaks down when the circle curves 
back on itself, which it does at the points where the tangent is vertical. This strongly 
suggests that in general an implicit function f(x, y) = 0 admits an expression as an 
explicit function y = g(x) in any range of points x where there points (x, y) such 
that f(x, y) = 0 and of # 0, because 


of af 
O0=df = —dx+ dy, 
f Ox Oy af 
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and so 
dy __ Of ,of 
dx Ox’ Oy’ 
and the tangent is vertical when of is Zero. 


As so often in the 18th century, mathematicians posed the problem not of the very 
existence of an explicit function but of finding its specific expression so that it could 
be put to use. This was also the case with inverting functions: given a function in 
the form y = g(x) when is there a function x = h(y)? In this case, the obvious 
condition to insist upon is that the graph of the function does not have a horizontal 
tangent. And because the most general expression for a function was taken to be an 
infinite series 

y=gotgixtgox?+--, 


the question was phrased as finding the inverse function 
x=hothiyt+hoy?+--, 


where the coefficients 4; will depend in some way on the coefficients g;. In this 
setting the problem was sometimes called the ‘reversion’ of series. Thus in his (1770) 
Lagrange was concerned with expanding the roots of a general polynomial equation 
as power series. He reduced the question to that of expressing p, a root of the equation 


a-x+y(x) =0, 


as a function of x, and he gave his answer as a power series expansion for p in terms 
of x that involved the derivatives of y and the subsequent setting of a to zero. This 
much was formal algebra; his deeper concern was the convergence of the infinite 
series. 

Some months later he tackled the same question in the context of celestial mechan- 
ics. Kepler’s equation (Lagrange 1771), 


t=x-—csinx, 


involves two known quantities ¢ and c, and the solution is to be given as x as a power 
series int with c as a parameter. Lagrange solved the problem geometrically, contrary 
to his usual practice, but in 1780 Laplace solved it algebraically, and Lagrange then 
used the result himself. After the formal work, the issue was again one of convergence 
of the series for the inverse function, for which, Lagrange observed, c had to be small. 
The problem was a difficult one, and Laplace returned to it again in 1825, which 
eventually caught the attention of Cauchy. 

AS we saw in Chap. 6 the late 1820s were a period when Cauchy was interested 
in complex variables and complex functions, and so he cast the problem of the 
reversion of series in that form and applied his residue calculus to the discussion of 
the convergence of the series of the inverse function. Once complex function theory 
was established, a full generation later, it was clear that the convergence of a power 
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series in a real variable is determined by what happens on its maximal circle of 
convergence, which may well be determined by what happens for complex values of 
the variable (as is the case with (1 + x”)~!). But that was not Cauchy’s idea, and he 
used his residue calculus only to handle the formal side of estimating convergence. 

Indeed, the story of the implicit and inverse function question so far illustrates the 
widespread belief among mathematicians that formal methods, that accommodate 
both real and complex values for variables, would validly take you a long way, 
although of course some further understanding of the convergence of series would 
be required, especially in applications. This approach had a lot to recommend it, 
but it was an obstacle to the clear understanding of what complex differentiability 
means and its implications for the representation of functions, and it meant that 
crucial distinctions between the real and the complex implicit function theorem were 
obscured. 


26.3 Cauchy 


When, then, was the question of the existence of an implicit function first addressed? 
Unsurprisingly, the answer is by Cauchy, not in these papers but, if only obliquely, 
in the lectures on differential equations that he gave at the Ecole Polytechnique in 
1820. Here he investigated when a differential equation of the form 


d 
~ = f(x,y) (26.1) 
»@ 


has a solution that took a particular value yo at a particular point x9. After dealing with 
many examples, in Lectures 7 and 8 he showed that if the function f is continuous 
and bounded as a function of x and y on the interval x9 < x < X then there is a 
function y of x that satisfies the differential equation on some interval of the form 
(xo, X9 + A) and for which y(xo) = yo.! 

His argument was that the solution to the Eq. (26.1) is very close to the collection 
of points (x;, y;) given by 


Yj — jar = Oy — Xj-1) fF j-1, Yj-1) 


when the points x; are close together. This is what you would expect, because when 
Xj; — Xj;—1 is small the quotient 

Yj ~ Vir! 

Xj — Xj-1 


should be very close to the value of ay and so the chord joining (x;-1, yj—1) to 
(x;, yj) is very close to the graph of the sought-for solution of the differential equation 


'To control what happens when the choice of initial value yo is varied Cauchy added the condition 
that the partial derivative of is also continuous and bounded as a function of x and y on the interval 


xo <x < X, but we need not pursue that here. 
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at the point (x;-1, yj-1). Then, as the points x; get closer and closer together, the 
conditions on the function f(x, y) are sufficient to ensure convergence, the function 
y is represented by the integral of f(x, y)dx, and the theorem is proved. 

Why is this connected to the idea of implicit and explicit functions? Consider the 
implicit function defined by 


gx, y) =x? +y*?-1=0, 


for which 


" X an a y 


It is elementary that the differential equation 


dy x 


dx y 
has g(x, y) as the unique solution that passes through the point (x, y) = (0, 0). But 
this gives the solution in implicit form; what Cauchy did, however, was to show that 
this equation has a solution in the form y = h(x), thus exhibiting the solution as an 
explicit function of x. 

Ten years later, Cauchy was the first person to tackle the question directly of 
when an implicitly defined function f(x, y) = 0 can be written locally as an explicit 
function y = g(x), which he did in the memoir he wrote in Turin in 1831 Fa 

In what follows we consider an equation f(x, y) = O defined on a suitable 
domain that contains the point (a, b) for which f(a, b) = 0, and look for an explicit 
function y = g(x) such that f(x, g(x)) = 0 and g(a) = b. We shall also show that 
under suitable conditions the explicit function g(x) is an analytic function of x on 
its domain. 

Cauchy’s proof is far from clear, and it will be helpful to suppose that (a, b) = 
(0, 0). On the assumption that y = 0 is a root of f(0, y) = 0 of multiplicity 1, the 
absolute values of all other roots of the equation f (0, y) = O are greater than a certain 
amount, which can be denoted Z. Cauchy aimed to show that under these conditions 
and for each fixed value of x in a certain interval, the equations f(x, y) = 0 and 
f(, y) = O have the same number of roots—one in each case—and so y is a 
single-valued function of x on that interval. 

Here is a free translation of what he wrote. I have imposed the simplification 
b=0. 


Cauchy’s (1831) 


Now let y be an implicit function of x determined by an equation of the form 
f(x, y) = 0, and b = 0 be a value of y that corresponds to a particular value 


>This account follows the version that he published in 1841. See also Bottazzini and Gray (2013, 
150-155). 


26.3 Cauchy 263 


of x. If one sets y = b+ z =z, we have f(x, z) = 0. This done, denote by fy (x, y) 
the [partial] derivative of f with respect to y. Suppose moreover that 


the equation f (0, z) = 0 has a unique real or imaginary root z such that |z| < Z 
for some Z, 

the function f (x, z) has a unique and definite value for every real or complex value 
of z such that |z| < Z, 

for such a value of z the function F(y) = F(z) never becomes discontinuous or 
infinite. 


Then, on writing 
Z= Zell! 


for a real arc q one will have the equation 


1 (* .fy@,zZ) 


Qn Jn f@.D 


F(y)= F(Z)dq. 


So it remains to deduce this result from [the Cauchy integral theorem, established at 
the start of the memoir], which one does as follows. 
When z = Z both terms of the binomial 


Sy, Z) ~ F(z) 
F 
faa ey 


become infinite; but the binomial itself generally has a finite value, to wit 


1 F(z) 
F(z) + 5 FAC aC 


Therefore, if the modulus Z of z is chosen in such a way that the above statements 
are satisfied, namely 


1. the equation f(x, z) = 0 has a unique root z such that |z| < Z, 
2. F(y) = F(z) never becomes discontinuous or infinite for any z such that |z| < Z, 


then the product 


(fed. Fe 
:( BRS r@ 25) 


will be a continuous function of Z for the stated value of Z or any smaller value. 
Now, by the earlier result 


M eT) aw o.- - " zdq 
GD F(Z)dq = F(z) a 


3The opaque notation is Cauchy’s. 
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= 2nF(Z) = 2nF(y), 


which is the formula we want. If one sets F(y) = | this gives 


dt Phas, 
mt Fed q=l. (26.2) 


If, on the contrary, one sets F(y) = y one concludes that 


at ee 
=m | OF HM 


that is, 7 
a 1 T fy, 2) 

y= — Z rs 

2a Jin =f (x, 2) 


(26.3) 


[End of the free translation. ] 

Cauchy then proved two theorems that allowed him to infer that the equations 
f(x, z) = 0 and f(0,z) = 0 have the same number of zeros for all x less than a 
certain amount. In the case when this number is | it follows that for each x in a 
suitable interval around x = 0 there is a unique z for each x, and so the equation 
Ff (x, z) = 0 defines an implicit function z = g(x). 

Here is the second of Cauchy’s theorems in the case when we are dealing with a 
unique root. 

Theorem IV If the equation f (0, z) = 0 has a unique root of modulus less than 
Z and moreover 


1. for all x such that |x| < X [for some non-zero real X] and z such that |z| < Z 
the function f(x, z) always has a unique and definite value, 

2. forx such that |x| < Z the function f (x, z)/f (0, Z) is developable as a convergent 
series in ascending powers of x for all x such that |x| < X [i.e. the function is 
complex analytic, as Cauchy would later realise]. 


Then for such an x the equation f(x, z) = 0 has a unique root of modulus less 
than Z. 
It will help to write 
f y(x, Z) 
f@D 


— h(x, Z). 


Cauchy’s insight was to expand h(x, Z) as a power series in X: 


~ ~ du, 4 duy 
h(x, z) = h(O, Z) +x—— + x° —= + ---, 
dz dz 
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where the seemingly artificial expressions for the coefficients of the powers of x 
are obtained are the derivatives of h(x, Z) at x = O and so are expressions in the 
derivatives of f(x, Zz). But because uv; and wz vanish, the result is that 


[meas | h(O, Z), 


But the expression on the left counts the number of zeros of h(x, Z) inside |x| < X, 
so it is is equal the number on the left, and by Eq. (26.3) this is 1. 

A further use of the Cauchy representation theorem, using the special cases above 
(F(y) = 1, F(y) = y), allowed Cauchy to deduce that this root is expressible as a 
convergent power series in x. This uses nothing more than the fact that the function 
h(x, Z) is analytic, which follows from the fact that f(x, y) is, in ways Cauchy did 
not specify, well-behaved (say, in later terms, analytic in x and y). 

Cauchy went on to prove the theorem that is usually called Rouché’s theorem in 
complex analysis today, which says that if the analytic functions f and g are defined 
on an open disc and at all points z on the boundary of the disc we have 


If) — 9g@I < IF OI, 


then the number of zeros and f and g in the open disc are the same when counted 
with multiplicities. Rouché gave this result in his (1862), noting as he did so that his 
presentation was more precise than anything that could be found in Cauchy’s papers. 
Cauchy’s argument was indeed messily presented, but the theorem is nonetheless 
present in Cauchy (1831), see his Theorem VII, p. 93. 

But what had Cauchy proved? One entirely correct modern answer (see e.g. 
(Krantz and Parks 2002, 27—31)) is that he had proved the implicit function the- 
orem for holomorphic functions—although not entirely rigorously or convincingly. 
But it is noticeable here, as it is throughout his work in the 1830s, that he had not 
appreciated the difference between real and complex differentiability. We can make 
his theorem precise by tying it to the holomorphic domain, but at the time Cauchy 
had no idea what the extent of that domain was. 

The first person to be clear about the implicit function in the holomorphic context 
from start to finish was Weierstrass, in his lectures in Berlin in 1861. They survive 
in unpublished notes by Schwarz* and other writers, such as Pincherle, who was an 
enthusiast for Weierstrassian methods. He remarked> (Pincherle 1922, 209) that the 
theorem 

was first proved by Cauchy, then a little later by Weierstrass with more precision and without 


recourse to curvilinear integrals. The method I am about to give, that proceeds more fluently 
and efficiently, is a mixed method. 


Pincherle considered a function F(x, y) = O that was regular and analytic in both 
variables when x is in a domain A and y in a domain B, and for which F(x, y) 


4Mentioned without further discussion in Bottazzini and Gray (2013, 378). 
>He used almost the same words in the edition of his lecture notes at Bologna in 1901. 
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does not vanish. He let (a, b) be a point with a € A and b € B and sought to define 
a function y = a(x) such that a(a) = b and F(x, a(x) = 0 and a is regular and 
analytic for all x € A. 

He first observed that it is enough to prove the theorem when a = 0 = b. He 
expanded the function F(x, y) as a power series 


F(x, y) = yo(y) + f@, y), 
where y(y) is given by a power series that does not vanish when y = 0: 
y(y) =cotciy+cay? ++. 


Clearly, f (0, 0) = 0. He took logarithms of both sides and obtained 


= f(x, y) 
log F(x, y) = log y + log y(y) + log {1+ 
yey) 
which he wrote as 
log F(x, y) = log y4 toe (1 ay © 2 -) log (1 fey), (26.4) 
co C] yey) 


Pincherle now chose a number p such that |y| < p; implies |y(y)| > A = co/2, 
and another number /2 such that |y| < p2 implies oe + oy? +.---| <q <1,and 
a value for p’ strictly less than p, the minimum of p; and p2. Then, in the annulus 
p' <\y| < pone has |yyp(y)| > pA. 

Then he chose a number r such that |x| < r implies that | f(x, y)| < p’ Ay, for all 
y in the annulus and for some jz < 1. Under these conditions (|x| <r, p’ < |y| < p) 


the term log (1 + f oe 


in negative and positive powers of y in which the coefficients are analytic functions 
of x regular in the disc |x| <r. 


) can be written as a uniformly convergent Laurent series 


The term log (1 + o4y4 2 y? +:: -) is regular and analytic for |y| < p, and so 


co 
Eq. (26.4) can be written in the form 


[ee 


log F(x, y) = logy + logcg + >, aj(x)y!. 


j=-00 


Pincherle differentiated it with respect to y and obtained 


Fy (x, y) 1 = ‘ 
ye? : -l 
SS > aj(x)yl™. 
F(x, y) y wee 


J=-oo 
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He then remarked that because the second term on the right hand side is the 
derivative of a Laurent series it cannot contain a term in y—! and so the residue of the 
logarithmic derivative of F(x, y) is 1. But, by standard results in complex function 
theory that he had proved earlier, it follows that F(x, y) has a unique root of first 
order j, for each X such that |%| < r given by 


1 f yFyG@,y) 
2Qni F(x, y) e 
around any circle centre the origin of radius less than r. Moreover, this root, which 
is the coefficient of y~? in the term on the right hand side, can be written as } = 
—a_ (X) and is a regular analytic function on |x| < r. This proves the theorem. 

The improvement that Pincherle imported was the use of Laurent series, which 

Weierstrass avoided because the most natural approach to them relied on the theory 
of the Cauchy integral and Weierstrass always endeavoured in his lectures in Berlin 
in using that body of ideas. 


26.4 Dini 


Clarity about the distinction between the real and complex cases of the implicit 
function theorem only came with the work of Dini in the 1870s, specifically his 
Lezioni di analisi infinitesimale vol. 1 1877 (Ch XIII, pp. 197 et seq.). He also gave 
the first rigorous extension of the theorem to several variables. In fact, and as a result, 
the implicit function theorem is known in Italy as Dini’s theorem. 

We shall follow him only in the simplest case. Dini supposed there was a function 
Ff (x, y) = 0 defined when x and y were in some region C—we can suppose x and 
y lie in two intervals. Furthermore, xo and yo in the appropriate intervals are such 
that f(xo, yo) = 0. The function f(x, y) is finite and continuous on this region, it 
has both first partial derivatives on the region, and of is non-zero. He then set about 
showing that for any value of xo + h sufficiently close to xo there is a value yp of y 
such that f(x9 +h, y;,) = 0. The function y = y(h) will therefore be the required 
explicit function. 

He supposed that there were positive numbers io and ko such that |h| < ho, 
|k| < kg implies that the point (xo + h, yo + k) is in C. He wrote 


fOo+h, yt k) = fxo+h, yt k)— f(xo, yo tk) + f(x0, vo +k) — fo, Yo), 
using the fact that f(xo, yo) = 0, and so 
f (xo +h, yo +k) — f(xo, yo +k) = hfx (xo + Oh, yo +k) 


and 
f (xo, Yo + k) — f (Xo, yo) = kfy (Xo, yo) + a, 
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where 0 < 6 < | and ais a number that, when ko is sufficiently small but not zero, 
can be made arbitrarily small. Therefore 


f(xo +h, vo +k) = Afx (x0 + Oh, vo +k) + k(fy(X0, Yo) + Q). 


He then observed that because f} (xo, yo) is non-zero, and for ko and a sufficiently 
small, one can say that for all k < ko the quantity fy (xo, yo) + a will always have the 
same sign as fy(xo, yo) and is greater than zero by at least an amount d. Therefore 
kfy(xo, yo) + @ takes values greater in absolute value than kod as k goes from —ko 
to +ko. 

He then defined \ as the lim sup of all the absolute values of f;(x, y) for x and 
y in the region considered, and noted that 


|[f(xo +h, yo + k)| < hod. 


Because ho < kod, if need be by choosing a smaller value of ho, it follows that 
f(xo +h, yo +k) goes from negative to positive as k goes from —ko to ko, and so, 
because it is continuous, there must be at least one value of k in that range for which 
fo +h, yo+k) =0. 

From this it follows that for each value of h such that —ho < h < ho that there 
is a value of k such that f(xo +h, yo +k) = 0. Therefore, for each allowed value 
of x there is one or more function y that satisfies the equation f(x, y) = 0 and for 
which when x = x9 we have y = yo. So, provided f(x, y) is non-zero in some 
subregion of C containing (xco, y) there is exactly one value of k and therefore one 
function y. 

He concluded that 


In the interval (xp — ho, Xo + ho) there is a well-determined, single-valued, real function 
y = yo +k that can be regarded as a function implicitly defined by the given equation 
fQ@,y) =0. 


26.5 Conclusions 


The implicit function theorem has been little studied by historians of mathematics; 
even the richly informative book (Krantz and Parks 2002) does not make it entirely 
clear how mathematicians made their way to it in the first instance. But it illustrates 
rather tidily some of the themes of any history of analysis. There was a prolonged 
phase when the possibility was taken for granted and the emphasis was placed on 
doing it in an effective way. This came to involve power series and questions of 
convergence, and this shows just how confused people were about real and complex 
differentiability, even when, as Cauchy was, they were clear that there was something 
to prove. Only quite late, after a proof in the complex domain, was a proof given 
of an implicit function in the real domain for functions satisfying a modest set of 
natural conditions. 
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Of course, the theory did not stop with Dini’s work. There are versions of the 
theorem for several variables in the 19th century, and many developments in the 
20th, as Krantz and Parks describe—but these must be themes for later historians. 


Chapter 27 
Towards Lebesgue’s Theory of Integration 


27.1 Introduction 


This chapter looks briefly at the Lebesgue integral, to see what questions it answered 
and what it is good for. The first half takes up two problems in analysis before 1900, 
the second half looks at how they, and the problem of the fundamental theorem of 
the calculus, were resolved using Lebesgue’s theory. 

This chapter and the next two one are offered as pointers to some of the ways 
analysis was to develop in the 20th century, and to branch into new domains, abstract 
set theory and topology. 


27.2 Problems 


The Length of Curves 


Consider the question of the length of a curve. A curve that has a length is said to 
be rectifiable, meaning that it can be straightened out without stretching it and made 
to measure its length along a line. The question became: what curves have lengths? 
Indeed, does every curve have a length? 

The second question can be disposed of simply. The possibly surprising answer is 
that there are even simple closed curves for which the answer is ‘No’—for example, 
the von Koch snowflake curve. The curve was introduced by Helge von Koch in 
1904, and published more accessibly in his (1906), because he wanted to illustrate 
Weierstrass’s discovery of a continuous, nowhere differentiable, function with a 
geometrical, rather than an analytic, example. 

To obtain the curve, start with an equilateral triangle of side 1. The rule for 
generating each successive approximation is this: regard each line segment of length 
a as being in three equal parts; place an equilateral triangle of side a/3 on the interval 
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Fig. 27.1 Growing an edge (a) 


of the von Koch snowflake 
curve 


(b) 
(c) 


(d) 


in the middle and then remove the middle interval. The first few stages are shown in 
Fig. 27.1 as they affect just one side. 

And here is an impression of the curve (Fig.27.2). Now you can see why it has 
the name it does: 

Now, the original curve (the equilateral triangle) has a length of 3. At the first 
stage, each edge is replaced by a piece of the new curve of length 4/3, so the next 
figure after the triangle has length 3 x 4/3 = 4. Indeed, at every stage the length of 
the curve goes up by 4/3, so the length of the von Kochcurve can only be infinite, 
which is another way of saying it has a length. 
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Fig. 27.2. The von Koch 
snowflake 


But what is the right way to define the length of a curve? Suppose a curve is 
defined as y = f(x), x € [a, b]. Elementary calculus says that the length is given 
by the integral [ (1 + (f(xy ig '/ 1x. This is defended by considering a sequence 
of points (¢;, f (¢;)) on the curve and arguing that when #;, — t; = dx; and therefore 
S G41) — f (t)) = dy; the length of the interval joining successive points on the curve 
is a good approximation to the length of the curve between those two points, and so 
the length of the curve between a and b is well approximated by the sum of these 
quantities, and then proceeding to the limit as 7 increases indefinitely. Since each 
interval has length (dx? + dy?)'/" = (1 + £(xi)°) — dx the familiar calculus 
expression follows. 

Now it might be argued that the geometry is good, and the length of the curve 
is given as the limit of a sum of lengths of intervals, but that the introduction of 
the derivative f’ is spurious, and that a curve might have a length even though it is 
not differentiable. After some tentative explorations by du Bois-Reymond and Otto 
Stolz, Ludwig Scheefer took up the question. In his (1884) he claimed that there are 
indeed curves which have a length, but for which the derivative oscillates in every 
interval and so the integral has no meaning as a Riemann integral. 


Scheefer’s Function 


Scheefer’s example went as follows. Let (s,) and (w,) be two sequences of real 
numbers and suppose that each s, > 0 and Xs, converges. Define a function f as 


follows: : 
f(x) = > 
—oco 


where the summation is taken over all s, such that w, < x.The function is monotonic 
increasing, since 


f(wrt+) — f(wr—-) = f(wr+) — f(r) = Sr. 
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At each w, the function f has a jump discontinuity of s, as you can easily check. 
Now suppose that the w, form a dense subset of (a, b). The image set f([a, b]) is 
[Lf (a), f(b) — UF (wr), f wr). 


Scheefer defined an ‘inverse’ function g = f —! on[ f(@, f(b)] as follows: 


ayy = [0 ify elf@, FO) 
wr ify € (f(w,), f(wr+)] 

The function g is flat on (f(w,), f(w,r+1), so g’ = 0 in each of these intervals. 
The sum of the length of these intervals is Us, = f(b) — f(a) so f ([a, b]) is 
a discrete subset of [ f(a), f(b)], and so g’ is zero except on a discrete set. On the 
other hand, the length of the curve y = g(t) plainly is © ,s, = f(b) — f(a), so 
this is an example of a curve with a definite length that is not to be evaluated by the 
familiar integral. 

This is reminiscent of a function that defines what is sometimes called the ‘devil’s 
staircase’ and is described at the end of this chapter. 

A breakthrough in this subject, although it was not entirely appreciated by its 
author, had earlier been made by Camille Jordan in 1881 when he introduced the 
concept of functions of bounded variation. We say that a function f : [a,b] > Ris 
of bounded variation if for every partition of [a, x], saya = to <t) <-++ <t-1 < 
th < x, the sums D/(f (t;) — f(tj-1) and —X”(f (t;) — f (j—-1)), taken respectively 
over the increasing and decreasing segments, have finite least upper bounds for all 
values of x in (a, b]. Let these bounds be P(x) and N(x) respectively. Then 


f@)—f@ = UF G)- fG-1)) = U'(F@)— FG-)-(-2"(F@) — FG-D) 


and so, in the limit, 


fx) — f@ = PQ) — N(x) 


which shows that the function f is the difference of two nondecreasing functions. 
The converse is trivial, so Jordan had shown that a function is of bounded variation 
if and only if it is the difference of two nondecreasing functions. 

Jordan published this work in a note attached to the third volume of the first 
edition of his Cours d’analyse, (1887), and again in the first volume of the second 
edition, (1893), so it was widely read. He drew the corollary that a curve y = f(x) 
is rectifiable if and only if the function f is of bounded variation, and in 1893 
he also noted that if a curve (x(t), y(t)) is a rectifiable curve for which x’ and y’ 
exist and are continuous, and s(t) denotes the length of the curve from, say, fo to f, 
then s’(t) = (uy + y'(t)”) ? But curiously, he did not deduce that the classical 
integral formula therefore applied in this case. Plainly he thought it had been shown 
to be irrelevant to the study of lengths of curves. 
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27.3 The Area of Surfaces 


The area of a surface is seemingly evaluated by the calculus in a perfectly routine 
way. Suppose that the surface is given in vector form as r(u, v). Any elementary 
textbook in differential geometry will tell you that the area is obtained by integrating 


(EG — py where 
E:="r,%y,, F:=Yyly, Gi=7yPy. 


But is this the best definition? Why, indeed, is it true? 

The rationale for this definition is that the expression (E G-—F *) /? js the area of 
the image of an infinitesimal square in the (u, v) domain, and of course the area of the 
surface is found by adding up (i.e. integrating) these infinitesimal distorted squares 
over the surface in question. Now, the analogy with the question of the lengths of 
curves is fairly clear. On a curve one wants to attach rulers as successive points, 
approximate the length that way, and then steadily refine the approximation so that 
the limit is a good definition of the length of a curve. So, on a surface, why not attach 
lots of triangles and proceed in the same way? This would mean that at each stage 
the area of the surface is approximated by the sum of the areas of the triangles. 


Schwarz’s Paradox 


It was Schwarz who found a most disturbing paradox by looking at this naive idea 
carefully. It had been given in a textbook by the French mathematician Serret, so 
Schwarz communicated his observation to Hermite, who promptly put in the notes 
for his Cours d’analyse for 1881. Schwarz (1881-1882) considered a cylinder (you 
can almost hear the magician saying “Nothing tricky here, just an ordinary cylinder, 
nothing special.”’) and considered it as the image of a map from the rectangle [0, 277] x 
[0, h] that sends (u, v) to (r cosu, r sinu, v), where r and h are constants (the radius 
and height of the cylinder). This cylinder has, of course, the area 2a7rh. He then 
inscribed 4mn triangular faces on the cylinder as follows: the vertices are the images 
of these points on the rectangle (Fig. 27.3): 

C= *) ‘a +1)r *) (o +1)r Qv+ =“) 


’ 
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All these triangles are isosceles and congruent, and they each have area 


rsin(Z) 2 (1-0o0(2))°+ (4). 
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Fig. 27.3. Triangles on 
a cylinder 


So the area of the collection of triangles is 


4mnr sin (=) ae (1 — cos (=))" + e) 
= 4mnr sin (=) js sin4 (—) + ey (27.1) 


It is now desired to take the limit as the number of triangles increases, which happens 
as m and n tend to infinity. However, Schwarz observed, if one lets m = an this limit 
is 27rh, which is indeed the expected value (the dominant term in the square root is 
the xi). But if one sets n = am? then this limit is the limit of 


QnrV aren? + h? 
which depends on a and can therefore be arbitrarily large. Likewise, if n = am? the 


area is the limit of 
*m' sin (~~) sin? (5) 
8r-m* sin ( — } sin* ( — 
m 2m 


and again the limit can be arbitrarily large. 

The naive definition of area is therefore erroneous, and Schwarz had no suggestion 
as to how to put it right. Although Peano came along a few years later, as did Hermite, 
with suggestions for what could be done, their ideas were conspicuously patches and 
the damage remained done. It looked as if there was no satisfactory definition of the 
area of even the simplest curved surfaces.! 


'For a thorough account of the contributions of Peano and Lebesgue, see Gandon and Perrin (2009). 
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27.4 Lebesgue’s Answers 


The length of a curve was defined by Henri Lebesgue as the smallest limit of the 
lengths of polygonal lines tending uniformly to the curve. He then showed that if the 
curve is given parametrically as 


x=f®), y=o@), z=¥@, 


then when f, ¢, and ~ have derivatives, a necessary and sufficient condition for a 
curve to have a length is that the integral of J(f@? + b(t)? + W'(t)?) exists. 

Similarly, Lebesgue defined the area of a surface L as the least limit of the areas 
of polyhedral surfaces that tend uniformly to L. He then showed that in the special 
case where the surface has continuously varying tangent planes the area is indeed 
given by the formula [{ “EG — F?dudv. 

It took much longer for Lebesgue to resolve the problems with the funda- 
mental theorem of the calculus. Recall that this can be phrased in three different 
ways, concerning a function f(x) defined on an interval [a,b], and the function 


Fie _, f. 


e I: F is a primitive function for f, that is F’ = f 

e II: Every primitive function of f is of the form iF f+ec 

e III: (a result used by Cauchy to prove II): If G is such that G’(x) = 0 on [a, b] 
then G(x) is constant on [a, b]. 


Lebesgue could now prove the fundamental theorem of the calculus (II) for functions 
with bounded derivatives, which had been a problem before because of the existence 
of bounded derivatives that are not Riemann-integrable. First he was able to show 
that, on his definition of measurable sets, if the function f is the limit of a sequence 
of measurable functions (/f;,), then F is also measurable. Notice that this is not true 
of Riemann-integrable functions. Then he showed that if, in addition, the functions 
| fn(x)| are all bounded by anumber B, then ff = limn—oo f fn. Finally he applied 


these ideas to the functions Lath fe) as follows: 
b b xth 72 
: f(x +h) — f(x) . | 
| ug sim | A ae is ; f(b) — f(a) 


This still left the fundamental theorem of the calculus in the form ( is xy! = f(x). 

In 1902-1903 Lebesgue was invited to give the prestigious Cours Peccot Lectures 
at the Collége de France. He found the opportunity stimulating, and the published 
version, his Lecons sur l’intégration et la recherche des fonctions primitives (1904) 
catries a fine axiomatisation of the idea of integration. Lebesgue showed that there 
was essentially only one good definition of the integral, it satisfied these natural 
axioms, and his definitions exemplified them. The axioms are: 
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i b b+h 
[ fooar= f(x —h)dx; 
a ath 
2. 
b c a 
fr+f f+ f f =9; 
a b c 
3. 
b b b 
foto=fore fo 
4. i 
Iff(a) = Oand b >a then / f = 9; 
5; 
1 
[- 
0 
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b b 
If fa(x) t f(x) then [ pay f. 


Lebesgue could now show that if any of the four Dini derivatives Df of f are 
integrable then [” Df = f(b) — f(a. 

It is interesting to see that the Dirichlet function on [0, 1] that takes the value 1 
at rational points and 0 at irrational points is Lebesgue integrable by considering the 
function 

lim ( lim (cos(m!nx))™"). 
m—>wo \n->oo 
This example is due to Lebesgue himself. 

By now numerous people were responding to Lebesgue’s ideas, and finally in 
1907 Lebesgue came to the triumphant result that the fundamental theorem of the 
calculus in the form I can be proved, indeed for any integrable (but not necessarily 
continuous) function f: 

x / 
([ 1) =r 
a 
almost everywhere. 
That is to say, equality holds except on a set of measure zero, one negligible for the 
purposes of education. 


This is a good place to end what might seem like an extended advertisement for 
the Lebesgue integral. 
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Hausdorff’s Paradox 


The mathematician and philosopher Felix Hausdorff should, however, have the last 
word. Hausdorff was a radical in his approach to mathematics. He gave a lecture 
course on Time and Space in 1903-1904 in which he said that” 
Mathematics totally disregards the actual significance conveyed to its concepts, the actual 
validity that one can accord to its theorems. Its indefinable concepts are arbitrarily chosen 


objects of thought and its axioms are arbitrarily, albeit consistently, chosen relations among 
these objects. Mathematics is a science of pure thought, exactly like logic. 


Hausdorff’s interests in mathematics deepened after 1911. He was the first to 
take set theory up energetically since Cantor, and his major work, important for 
both set theory and topology, is his Grundziige der Mengenlehre (1914). Hausdorff 
published what became known as his paradox in the next year, 1915.° It states that on 
any definition of measure that satisfies Lebesgue’s original four axioms, there will be 
bounded sets that are not measurable. To derive a contradiction, Hausdorff assumed 
that every bounded set is measurable. He took the sphere in three-dimensional space, 
and divided it into four disjoint sets A, B, C, D with the property that A, B, C and 
B UC are all congruent, while D has measure zero. Because the sets A, B, and C 
are congruent, they have equal measure, and because they cover the sphere except 
for the set D of measure zero, each must have measure 1/3. But the sets A and BUC 
are also congruent and cover the sphere except for the set of measure zero, so they 
must have equal measure, which must therefore be 1/2. It follows that the set A has 
measure 1/2 and 1/3, which is impossible. Accordingly, the assumption that there is 
some definition of measure according to which every bounded set is measurable is 
false. 

How could mathematicians respond to this unsettling and paradoxical argument? 
They could agree that’s how it is—there will always be non-measurable sets, which is 
a limitation on the uses of measure theory that might prove worrying; they could reject 
the concept of measure and look for another concept altogether—but Lebesgue’s 
four axioms are very natural ones with many successes to their credit; or they could 
scrutinise the proof and hope to find a flaw. Borel took this third way out. Hausdorff 
had used the axiom of choice in his proof, and Borel replied that the paradox came 
about not because measure was an inherently flawed concept, but because the set A 
was not properly defined.* To construct a set using the axiom of choice was for Borel 
no construction at all. “If one scorns precision and logic”, he wrote, “one arrives at 
contradictions”.> Hausdorff, of course, was not persuaded. He was happy with the 
axiom of choice and simply not bothered that any definition of the area of a set is 
inherently imperfect—even though this was a conclusion that could never have been 
dreamt of by researchers a generation before. 


2See Corry (2006, 148). 
>Hausdorff (1915). 
4See the second edition of his Lecons sur la théorie des functions (1914). 


Borel (1914, 256), quoted in Moore (1982, 188), who also notes that a number of Italian mathemati- 
cians had already explicitly rejected the idea that one can make infinitely many arbitrary choices, 
Peano among them, see Moore (1982, 76-82). 
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The Staircase Function 


The Cantor function of Kolmogorov and Fomin is defined as follows. Recall that the 
Cantor set is obtained by deleting open intervals from J = [0, 1] in the following 
order: first delete J, = (1/3, 2/3). Then delete Jz = (1/9, 2/9) and J3 = (7/9, 8/9). 
At the nth stage, delete the 2” open intervals which, going from left to right can be 
called Jgn—-1, Jgn-14.4,.- +5 Jon-14gn-1_1 = Jon_1. Define a function f; on J~ which 
is att where k = 2” +r, O<r < 2” (soon J43 the function is 11/16). Consider 
the function f on UxJ; that agrees with f, on J;, and define the Cantor function 
to be the unique continuous function on the interval J that agrees with f on Ux Jk 
(which is of course, the complement of the Cantor set). 

This remarkable function is continuous, monotonic increasing, bounded, and 
indeed differentiable on [0, 1]. It takes every value from 0 to 1, but its derivative 
is zero everywhere except on a countable set of measure zero! 

Here’s another Cantor function. Write every number in [0, 1] as a ternary decimal, 
with a convention about numbers that end in ...22222222... which can also be 
written to end in ... 100000.... The function maps a number with no 1’s in its 
ternary expansion to itself, and a number of the form 0.aja2...az...., where ax is 
the first 1 to O.ajaz...a,00000.... 

Here are two graphs that give an impression of the staircase function 


Figure 27.4 shows the sum of the first 30 terms. 

Figure 27.5 shows the difference between the 20th and the 10th terms, showing 
how crinkly the additions are becoming. 

The function is monotone increasing. It is continuous at x if and only if x is 
irrational, and if x is irrational then the value st (x) is transcendental. 


Fig. 27.4 The sum of the 24 
staircase function to 30 terms 
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Fig. 27.5 The difference 
between the 20th and the 
10th terms of the staircase 0.01 | 
function 
0.008 | 
0.006 | ra 
0.004 | 
0.002 | 
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27.5 Exercises 


1. Do we need a theory of the real numbers if our intuitions are reliable? Are they 
reliable? 

2. What was meant by a domain of a function of two variables? Compare the likely 
answers of Riemann, Harnack, and Poincaré. 

3. What is good, and what is not so good, about the Riemann integral? 


Chapter 28 
Cantor, Set Theory, and Foundations 


Georg Cantor (1845-1918) 


28.1 Introduction 


Dedekind had ushered in the idea that mathematical concepts might profitably be 
defined in terms of a naive conception of sets. His friend Cantor extended this to a 
theory of sets of various sizes, and came to a number of paradoxical conclusions, 
such as the existence of a one-to-one correspondence between a line and a square, 
which threatened the fundamental idea of the dimension of these and other domains. 
He also discovered that infinite sets could have different sizes, as for example the set 
of rational numbers and the set of real numbers, but he was never able to establish his 
belief that the smallest uncountable set can be put in a one-to-one correspondence 
© Springer International Publishing Switzerland 2015 283 


J. Gray, The Real and the Complex: A History of Analysis in the 19th Century, 
Springer Undergraduate Mathematics Series, DOI 10.1007/978-3-3 19-237 15-2_28 


284 28 Cantor, Set Theory, and Foundations 


with the real numbers (the continuum hypothesis). Indeed, as we shall see, the whole 
edifice of set theory came to be threatened by a number of paradoxes that were put 
forward around the start of the 20th century. 


28.2 The Continuum 


For much of his life, Georg Cantor was interested in the continuum, a.k.a. the Reals. 
His problem was: how to characterise it among the collection of all sets. In this 
endeavour, he never succeeded, because there were substantial, indeed permanent 
obstacles in his way. Some problems were topological, others proved to belong in 
the uncharted but shifting domain of the foundations of mathematics, and while both 
are interesting we shall take only the foundational route here. 

The first question concerns the size of the continuum, and eventually it became 
known as the continuum hypothesis. The first landmark is Cantor’s discovery that 
infinite sets can actually have different sizes. Cantor came to this conclusion in 1873, 
when he showed that there are both countable and uncountable sets. His first proof 
is not so well known. It is not difficult to count the rational numbers, or even the 
algebraic numbers (the roots of polynomials with integer coefficients). To show that 
the real numbers are not countable, Cantor argued by contradiction. 

Suppose that the real numbers are denumerable, and list them: 


W1,W2,.--,Wn,--- 


Cantor now exhibited a real number not in this list, as follows. Choose a < (3, and 
let the first two numbers in the list that lie between a and ( be a’ and /3’. In the same 
way chose a” and 3”, and so on. Either you must stop after finitely many steps, or 
not. It you stop at, say, a, and J, it is easy to find two real numbers between a, 
and 3, but they cannot both lie in the list. If you do not stop, consider lim ay, = Ago 
and lim 8, = (oo. If these are equal, say to w, then w is not in the list. For, if it 
is, then w = w, for some n and then it would have been an a or a (2. If, on the 
other hand, lim a, < $9, then any real number between lim a, and Go produces a 
contradiction. 
Cantor’s second proof was the famous diagonal argument of 1891. 


28.3 The Line and the Square 


In 1877 Cantor wrote to Dedekind to describe a 1-1 correspondence between the 
points of the line and the plane. He was so struck by this that he switched from German 
to French to say ‘I see it, but I don’t believe it!’ (see Cavaillés and Noether 1937, 34). 
His argument was that one can take two infinite decimals, the coordinates of a point 
in the plane, and riffle shuffle them together to produce a single decimal. Equally, one 
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can take a single infinite decimal and take it apart, thus establishing the remarkable 
1-1 correspondence. Cantor believed his discovery undermined almost everything 
that had been said about geometry and the dimension of figures, but Dedekind almost 
spoiled the party by observing that the argument was flawed. There has to be a 
convention about decimals ending in a run of 9s, for they are equal to decimals 
ending an infinite run of Os. Pick one, say we allow all 9’s and disallow all Os. Consider 
now the numbers 0.1 1a20.. . a, 0an+10...and 0.410429... dn 9an419.... They are 
different, but they map to (0.a;a2a3..., 0.1000...) and (0.a,a2a3..., 0.0999...) 
respectively, and these are the same; Cantor’s map is not 1-1. Cantor agreed at once, 
but repaired the proof within two days, although the initial simplicity was lost in the 
process. Dedekind continued to observe, however, that the damage to mathematics 
was not as great as Cantor had feared (or hoped?)—Cantor’s map was very obviously 
not continuous. 


28.4 Transfinite Ordinals 


We saw in Sect.23.2 above that Cantor had studied nowhere dense sets on the real 
line. Transfinite ordinals grew out of this work, and the first transfinite symbols were 
indices before they acquired a new significance, and for that they had to wait a decade. 

In the work on Fourier series, Cantor had the idea of forming a sequence of sets, 
each one the derived set of the set before: 


Py PON PN cP ws 


and continuing this process ad infinitum, thus obtaining a set he denoted P‘~). But 
of course it is possible to start with this set, and form the sequence 


P! >) pl >+1) pl ee aL 


and so on, to form P‘?~?, and beyond. 

In his Grundlagen (1883) Cantor replaced the arrow with the Greek letter w, and 
regarded the indices as numbers. He realised he had the familiar sequence of finite 
ordinals 1,2,3,... (which are to be thought of as the first, second, third and so 
on rather than as one, two, three and so on) but that he had more. He had the first 
transfinite ordinal, w, and a succession of ordinals beyond it: 


wt+tl,w+2,...,2w,2w4+1,... 
He also saw that they possessed an arithmetic, in which, however, w+ 1 4 1+ w, 


for w+ 1 is {2,3,4,5,...}, {1} and 1 + w = {1}, {2,3,4,...} = w. So addition of 
these new numbers is defined, but it is not commutative. 
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Cantor now began to outline the corresponding theory of transfinite cardinals, 
which should measure the size of infinite sets. Here he was both remarkably success- 
ful and unsuccessful. He put his theory on the map, despite the indifference if not 
indeed dislike of some older mathematicians, it weathered the first storms it attracted 
when the set-theoretic paradoxes were discovered, and Hilbert hailed it as a Paradise 
from which mathematicians would never be banished. But in the absence of good 
definitions his ideas were at best intuitive, and very few of the important results 
in the subject were established by him. Yet ultimately it did become the proving 
ground for many discussions of the foundations of mathematics, and it remains so to 
this day. 


The Power Set Construction 


This comes from Cantor’s (1890). Compare 7 = (0, 1) and the set of all its subsets, 
F ={f : I — {0, 1}}. Suppose there is a 1-1 correspondence from J to F that to 
x € I assigns fy € F, and to f € F assigns x € J. Define ®(x, y) = f(y). 
We now deduce there is a 1-1 correspondence between the set of maps of the form 
®(x, —) and the set F’. The map is onto, because given f € F pick xf € J. Then 
O(xp,y) = fx, = f(y). The map is 1-1, for if it is not then there are x, x’ € J 
such that ®(x, y) = ®(x’, y) for all y € J, but this implies f,(y) = fy,(y) for all 
y € 1, whichimplies f; = f,,, which implies x = x’. On the other hand, every map 
of the form ®(x, —) satisfies the equation ®(x, x) = f(x), and there is a map g 
such that g(x) #4 B(x, x)—just let g take the other value. 

It follows that a set is strictly smaller than its power set, and so there is a chain 
of larger and larger sets. By way of notation, I shall write the power set of a set S 
either as g9 (S) or as 25. So one has S < 6£(S) ~ 9 (go(S)) ~ ---. This in turn opens 
the question of what is meant by the size of sets, which Cantor measured by what he 
called their ‘power’ and later became known as their cardinality. This is both a more 
elementary concept than the order type of a set and yet harder to define. We might, 
for example, consider the cardinality of a set to be the set of all sets with which it 
can be put in a 1-1 correspondence—but if we do we shall be disappointed, as we 
shall see. 


28.5 The Continuum Hypothesis 


We can now state the continuum hypothesis, as it came to be known (Cantor seems 
never to have called it this). The set R has a cardinality, call it c, which is greater 
than the cardinality of Q, the rational numbers, denoted &o, the smallest infinite 
cardinal. In symbols, 8p) < c. From the construction of the real numbers it is clear 
that R C g{Q}, where go{Q} is the power set of Q. So c < 2%0, and in fact these 
cardinals are equal. The continuum hypothesis asks for a comparison of this cardinal 
with the smallest cardinal larger than Xo, namely &1, and it claims that these are 
equal. If the claim is false then c = 2*° is some other aleph, and the question of 
which one would arise. It is one of the great achievements of recent mathematics 
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to show that on the usually accepted axiomatisation of set theory (ZF for Zermelo- 
Frankel), and even granting the axiom of choice (making set theory into ZFC), the 
continuum hypothesis is independent of the axioms and so there are set theories in 
which the continuum hypothesis is true and others in which it is false. 


28.6 The Paradoxes of Set Theory 


A setis said to be well-ordered (by us, Cantor’s definition was different but equivalent) 
if there is an ordering on it, often denoted <, with the properties that: 


1. given any two elements in the set, a and b, exactly one of the following holds: 
a<b,a=b,b<a; 
2. every non-empty set has a least element. 


Well-ordered sets have the same order type if there is a 1-1 order preserving cor- 
respondence between them. Ordinal numbers are the order types of well ordered 
sets. 

So you might ask if every set can be well-ordered. If you do, what, for example, 
might be a well ordering of the real numbers? That seems very hard, if not impossible 
to produce. There are other difficult questions about infinite sets. The continuum 
hypothesis has already been mentioned. Another is the trichotomy of cardinals, the 
idea that given any two cardinal numbers, either one is bigger than the other, or they 
are the same size. Even the axiom of choice, which is the claim that given a collection 
of sets there is a map which picks out an element of each set and forms of set of these 
elements, is not obvious. In fact, as already noted, in axiomatic set theory Choice is 
taken as an axiom. 


The Greatest Ordinal? 


On the other hand, consider, with the Italian mathematician Burali-Forti in the 1890s, 
the collection of all ordinal numbers. Ordinal numbers form a well-ordered set, so 
this collection has an ordinal, which must be larger than any ordinal in the collection, 
but that collection contained all the ordinals. A paradox, if not indeed an antinomy— 
a fatal logical flaw. Or, consider the collection of all cardinals. An exactly similar 
argument also produces a paradox. 

Now why are these paradoxes taken so seriously? One very reasonable answer 
is that they should not be taken any more seriously than any of the other problems 
just listed (the continuum hypothesis, the well-ordering principle, Choice). They 
show that a certain, new, topic in mathematics is not properly understood. That’s 
interesting, challenging, and quite typical of mathematics. Burali-Forti, for example, 
thought that his argument just showed that trichotomy for cardinals did not hold, a 
result he deduced from his paradox. 

Cantor was also not bothered. He had discovered these paradoxes for himself, 
but he had strong theological views about accessible infinities of various sizes and 
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the truly inaccessible infinities appropriate to talk about God, so the set-theoretic 
paradoxes were small oddities to him. 

Nor was it the case that nothing could be said. To be entirely ahistorical for 
a moment, it eventually became clear that starting from ZF and adding an axiom 
produced a great deal of unity: ZF + Choice is equivalent to ZF + well-ordering, 
and to ZF + trichotomy. This isn’t entirely good news. ZF + Choice strikes most 
people as reasonable in a way ZF + well-ordering does not. 


Russell’s Paradox 


In 1902 matters became much more serious. What are, or should be, the foundations 
of mathematics? Dedekind’s construction of the real numbers out of sets of rational 
numbers is exemplary, and when he also constructed the natural numbers out of 
mere sets (not an easy thing to do and not to be described here) the result was to 
show that the most fundamental objects in mathematics can all be thought of as sets. 
Mathematics no longer needed to be founded in real-world objects, which was just as 
well because it was plainly capable of describing situations where real-world intuition 
was completely inadequate. How then could mathematics be true? A mixed group 
of philosophers such as Bertrand Russell, mathematicians turned philosophers such 
as Frege, and also a number of distinguished mathematicians such as Poincaré (and 
Hilbert, Zermelo, and Hausdorff) became interested in this question. To some, but 
not all of this group, the answer lay in logic. The truths of logic seem indisputable, so 
if mathematics could be reduced to logic theorems in mathematics would be logically 
true. 

Set theory is not logic, so the task of reducing the one to the other is far from 
trivial, and by 1900 the front runner in this enterprise was Frege. As he was finishing 
the second volume of a work that was intended to show that the whole logicist 
enterprise, as this philosophy of mathematics is called, can be made to work, he 
received a postcard from Russell. It concerned the class of all predicates that cannot 
be predicated of themselves. In more usual mathematical language, let us say that a set 
contains itself if itis an element of itself: S € S. We define A := {S: S ¢ S}, and ask 
if A contains itself or not. By definition, Ac A=> A¢A,andA¢gA=>A€EA. 
Trouble either way! 

Russell’s paradox, known already in Gottingen to Ernst Zermelo and Hilbert, can 
be given a much more logical, less mathematical, presentation. Frege saw at once 
that it destroyed his life’s work, and he had the honesty to say so. Indeed, the damage 
cannot be put right. There is an irreducible step in any foundation of mathematics that 
goes beyond elementary logic. But the paradoxes, enjoyable though they are, only 
destroy the logicist approach to the foundations of mathematics. Other approaches, 
closer to naive set theory, have other problems but more life in them. That, of course, 
is another story. 

The only way round the paradoxes is to put limitations on what collections of 
objects can be sets. However, when mathematicians have attempted to characterise 
what sets are, say by a system of axioms guaranteeing their properties, it seems that 
the bounds of elementary logic are irreversibly crossed. 


Chapter 29 
Topology 


29.1 Introduction 


Here we look briefly at problems to do with curves and surfaces, lengths and areas, 
starting with the great question of: what is a continuous curve? In particular, does 
a simple closed curve on a sphere have a simply-connected interior and a simply- 
connected exterior? Then we return to the problem of distinguishing a the unit interval 
from the unit square, and we conclude by showing how curves can have areas (non- 
zero Lebesgue measure). In this way we shall pick up some of the roots of both 
point-set and algebraic or geometric topology. 


29.2 The Jordan Curve Theorem 


The Jordan curve theorem states that a simple closed curve divides the plane into two 
regions. It was refined in Schoenflies (1906) to the claim that one of these regions is 
topologically a disc and the other is a disc with a point removed. 

Jordan first stated the claim in his Cours d’Analyse (1887).! He had in mind the 
question that until the theorem is proved, what sense does it make to evaluate the 
Cauchy integral of a meromorphic function round a curve and count the contribution 
of the points inside? This is, of course, easy for simple contours (the disc, a semicircle, 
a rectangle, a half-plane) but by the late 1890s very general curves were being 
investigated from a number of standpoints: what then? 

How can the theorem be difficult? Let us start with the simplest cases. A triangle 
with three straight sides satisfies the theorem because we can orient the sides of the 
triangle in the same way (so that we may follow round the triangle in a way that 


'See vol. 3, 587-594. It does not seem to occur in some later reprints, e.g. in 1959. 
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respects the orientation on the sides). With respect to this orientation we may say 
that each side extends to a line that divides the plane into two regions, and the inside 
of the triangle that is on the same side of all three lines. This proof fails, however, 
even for quadrilaterals (precisely, for re-entrant quadrilaterals), but it highlights a 
relevant feature of the plane (that it is orientable). The argument is incorrect when 
applied to the projective plane or the Mobius band. 

For a polygon with a finite number of straight sides a popular informal proof asks 
us to consider a point that does not lie on any of the lines obtained by continuing 
any of the sides. Any half-line through such a point crosses the sides a finite number 
of times, and if we count the intersection points (counting any such vertices of the 
polygon as two points) we find that the inside of the polygon may be defined as 
those points for which this number is odd (and the exterior as the points for which 
the number is even). The remaining points, other than those that lie on a side, are 
assigned to the inside or the outside by looking at their immediate neighbours. But 
we need to be sure that we have not accepted this proof because we believe the Jordan 
curve theorem and go “inside, outside, inside, ...” as we cross each side until we 
escape. 

But what if the polygon has infinitely many straight sides? What if the curve 
is a rectifiable curve? Can we proceed by approximating it sufficiently accurately 
by polygons with finitely many sides—but surely the inside of an approximating 
polygon need not always be a subset of the inside of the curve? Matters are easier 
if the curve is differentiable, because then one may use segments of tangents of 
curves as sides of a suitable polygon. One can hope that differentiable curves have 
convex and concave sections—but what about curves that oscillate infinitely often? 
And non-differentiable but rectifiable curves and non-rectifiable curves are still out 
of consideration.” 

Jordan offered a proof that began with the polygonal case and addressed these 
difficulties. Some ten years later a second proof was offered by Oswald Veblen in 
1904, and this proof seems to have become generally accepted, although several 
different proofs have since been given. A consensus then grew up that Veblen’s 
was the first correct proof and that Jordan’s argument was either obscure in some 
way or even flawed, based perhaps on Veblen’s complaint that Jordan’s proof “‘is 
unsatisfactory to many mathematicians. It assumes the theorem without proof in the 
important special case of a simple polygon and of the argument from that point on, 
one must admit at least that all details are not given’”.* 

However, this consensus has recently been challenged by Thomas Hales, who 
consulted Jordan’s proof and looked for the alleged error, only to find that 


In view of the heavy criticism of Jordan’s proof, I was surprised when I sat down to read 
his proof to find nothing objectionable about it. Since then, I have contacted a number of 
the authors who have criticized Jordan, and each case the author has admitted to having no 


*Non-rectifiable curves make for other problems too: What is the integral in the Cauchy integral 
theorem for closed curves that do not have a length, such as the von Koch snowflake curve? 


3 Quoted in Hales (2007, 45) from which the next few paragraphs are drawn. 
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Fig. 29.1 Alexander’s 
horned sphere 


direct knowledge of an error in Jordan’s proof. It seems that there is no one still alive with a 
direct knowledge of the error. 


Hales then gives a modernised version of Jordan’s argument that preserves the 
essentials of his argument. 

The analogue of the Jordan curve theorem is also true in n dimensions, but the 
claim about topological simplicity of the regions is false, as the Alexander horned 
sphere shows. This was introduced to the world by the American topologist James 
Waddell Alexander II in his paper (1924) (Fig. 29.1). 

The construction of the horned sphere is an iterative one. It starts with a torus. 
A slice is removed from it, leaving a cylinder with two edges (@ and y facing each 
other. To each end of the cylinder a torus is attached, in such a way that the two tori 
are linked. The construction is now repeated on each torus ad infinitum, to produce 
a surface &. In Fig. 29.2, taken from Alexander’s paper, the left and right had ends 


Fig. 29.2 The construction of Alexander’s horned sphere 
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of the cylinder are labelled S$); S2 and $3 are the tori attached at the ends, and a hint 
is given of the next stage in the process. 
Alexander remarked (pp. 9, 10) 


It will be seen without difficulty that the interior of the limiting surface X is simply connected, 
and that the surface itself is of genus zero and without singularities, though a hasty glance 
at the surface might lead one to doubt this last statement. The exterior R of & is not simply 
connected, however, for a simple closed curve in R differing but little from the boundary of 
one of the cells 7; cannot be deformed to a point within R. It is easily shown, in fact, that 
the group of R requires an infinite number of generators. [...] 


This example shows that a proof of the generalized Schonfliess theorem announced by me 
two years ago, but never published, is erroneous. 


29.3 The Problem of Dimension 


Naively, curves and surfaces are different. The Cantor map shows that some care 
must be used to say how they are different, but a clue might be that the map is not 
continuous. In 1890, the Italian mathematician Guiseppe Peano caused a great stir 
with his example of a continuous map of the unit interval onto the plane, the first 
so-called space-filling curve. Peano had a particular distrust of geometric reasoning, 
which distrust his novel curve was intended to promote, the more comprehensible 
descriptions are, however, pictorial and due to David Hilbert in his (1891), later 
discussed by the American mathematician E.H. Moore in his (1900).* 

Start with the series of approximations shown in Fig. 29.3. 

Observe how straight and hooked segments are replaced as each square is divided 
into 4. The curve Hilbert produced is the limit of the series that is produced by 
iterating this construction. A better picture of it is given by the sixth step, which is 
shown in Fig. 29.4, taken from Sagan (1994, 11). 

With these examples in mind, it is worth noting the idea of a covering dimension 
of a space that was introduced by Lebesgue (1911) and later studied by many authors. 
It is the smallest number 7 such that any open cover of the space contains at least 
one point belonging to n + | sets of the covering. On this definition, a real manifold 
of dimensions n happily has covering dimension n. 


29.4 Osgood’s Theorem: Curves with Finite Area 


Here by area we mean 2-dimensional Lebesgue measure! The American mathemati- 
cian William Fogg Osgood introduced this curve in his (1903). It is described in 
Sagan (1994, 133-135), which should be consulted for the details. 


4Moore also discussed continuous curves with no tangents in the same spirit. 
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Fig. 29.3. The first three stages in the construction of Hilbert’s space-filling curve 


Fig. 29.4 The sixth stage in 
the construction of Hilbert’s . 
space-filling curve 
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Remove 2 bars horizontally and 2 bars vertically from a unit square, leaving 9 
squares, as shown in Fig. 29.5. If the widths of the bars are w then the side of each 
little square s; is given by 3s; + 2w = 1, so s; = (1 — 2w)/3 and the area of the 9 
squares is (1 — 2w)?. Now, from each little square, remove bars of width w/6, and 
get 81 squares. Note that at each stage in this part of the construction we select solid 
squares, not parts of an ‘honest’ curve. You should compute the total area of the 81 
squares. 

At each stage, join the 9 new squares as shown in Fig. 29.6. To get a curve, draw 
a curve that lies in each square as shown (and joins up to the next square)—note that 
the curve also lies in all the previous sets of squares, too. 

Osgood’s curve is the limiting curve as the process is repeated indefinitely. It is 
possible to parameterise it so to prove that it is a continuous Jordan curve. Nonethe- 
less, by choosing w to be arbitrarily small the curve has two-dimensional Lebesgue 
measure arbitrarily close to 1. 
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(0,0) (1,0) (0,0) (1,0) 


Fig. 29.5 The first two stages in the construction of Osgood’s squares 


Fig. 29.6 The first two 
stages in the construction of 
Osgood’s curve S355 


29.5 Concluding Remarks 


It will be clear by now that this last chapter belongs to the genre of final lectures, in 
which some of the ideas of the course are to run around briefly free of the constraint 
to flow clearly and yet rigorously from what has gone before, in the hope that they 
will excite the students. And indeed, these examples were chosen with that aim partly 
in mind. But they were also presented to show that analysis did not stop around 1900, 
and that the social and historical processes thorough which mathematics is done also 
continued. 

In a period when fractal sets and Hausdorff dimensions are much discussed the 
liveliness of the mathematics needs no emphasis. It is also worth noting that with 
every passing year the mathematical analysis of the last century also needs its histo- 
rians. 


Chapter 30 
Assessment 


On the view that end-of-course assessment ought to ask students to review their 
course and especially the later parts of the course, I tended to set rather straight- 
forward essay titles. A direct request for a survey of the century could only produce 
arushed composite of the material, so I chose to narrow the period to the period 1850— 
1900 or 1860-1900 (the difference being Riemann). I therefore asked students first to 
describe the main developments in mathematical analysis in the years 1850 (or 1860) 
to 1900, making clear what was important about them, and then to consider what 
were the most important problems or opportunities facing mathematical analysts as 
the 19th century came to an end. I added that the dates are approximate and may be 
varied by a few years if it helps their argument, but may not be narrowed. 
I also offered the following advice. 


Pick three or four interesting topics for each part of the essay, and look for moments 
when the nature of the subject changed, or the criteria for its importance shifted. 
Remember to consider who found it important, and why, and try to give their reasons, 
not yours (or, if you prefer, give their reasons as well as yours). Try to assess their 
influence and to look for dissenting voices, if there were any. 

The key words people use on such occasions (novel, fundamental, profound, rigorous) 
are often revealing: what developments merit what words? How and to what extent 
were judgements of this kind grounded in the mathematics? 

In the second part, do not let your knowledge of current mathematics lead you to 
suggest that the future could be predicted. The best answers will be grounded in what 
was said in the first part. 


As for the writing of the essay, a wealth of mathematical detail is not necessary, 
but mathematical accuracy is. 

Don’t end up with a final paragraph full of superlatives that don’t stand up to 
scrutiny. 

Finally, all manner of conclusions are OK—if well argued! 
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Two currents were at work: a wish that the students would gain some insight into 
why the subject they have spent so much time studying, and which can seem remote 
from practical and utilitarian concerns, has the shape and scope that it does; and a hope 
that they would see the growth to some extent from the inside, as a mathematician 
active in the 19th century might have done. If it is not seen as a succession of 
facts (definitions, theorems, examples, counter-examples, proofs) but as a creation 
of people engaged in various aspects of what is involved in making good sense of 
the greatest idea in modern mathematics —the calculus, broadly defined—then it can 
seem like a challenge. And if the course has managed to convey something of the 
messy, undirected, uncertain ways in which research is done and new mathematics 
discovered it may even be a resource for students after they have graduated and begun 
to work on their own. 

But now it seems to me that I could have been more adventurous, and set titles 
like these: 


e What is the importance of the concept of oscillation (in the values of a function) 
for the development of mathematical analysis in the period 1860-1910? 

e How true is it to say that by 1900 real analysis had become counter-intuitive? 

e Darboux began his address to the International Congress of Mathematicians in 
Rome in 1908 (Darboux 1909, 105) by saying in reference to complex analysis 
that “A profound change has been brought about before our eyes in the orientation 
of mathematical studies in the last 30 years”. To what extent would you say this 
was true of the period 1850-1880? 


Appendix A 
Papers in Translation 


A.1 Fourier on Fourier Series 


[From Fourier, The analytical Theory of Heat, tr. A Freeman. ] 
§220. We see by this that the coefficients a, b,c, d, e, f, &c., which enter into the 
equation 


1 
gree) =asinx + bsin2x +csin3x +dsin4x + &c, 


and which we found formerly by way of successive eliminations, are the values of 
definite integrals expressed by the general term [ sin kx@(x)dx, k being the number 
of the term whose coefficient is required. This remark is important, because it shews 
how even entirely arbitrary functions may be developed in series of sines of multiple 
arcs. In fact, if the function ¢(x) be represented by the variable ordinate of any 
curve whatever whose abscissa extends from x = 0 to x = 7, and if on the same 
part of the axis the known trigonometric curve, whose ordinate is y = sinx, be 
constructed, it is easy to represent the value of any integral term. We must suppose 
that for each abscissa x, to which corresponds one value of ¢(x), and one value of 
sin x, we multiply the latter value by the first, and at the same point of the axis raise an 
ordinate equal to the product (x) sin x. By this continuous operation a third curve is 
formed, whose ordinates are those of the trigonometric curve, reduced in proportion 
to the ordinates of the arbitrary curve which represents @(x). This done, the area of 
the reduced curve taken from x = 0 to x = 7 gives the exact value of the coefficient 
of sinx; and whatever the given curve may be which corresponds to (x), whether 
we can assign to it an analytical equation, or whether it depends on no regular law, it 
is evident that it always serves to reduce in any manner whatever the trigonometric 
curve; so that the area of the reduced curve has, in all possible cases, a definite value, 
which is the value of the coefficient of sin x in the development of the function. The 
same is the case with the following coefficient b, or f (x) sin2xdx. 
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In general, to construct the values of the coefficients a, b, c, d, &c, we must imag- 
ine that the curves, whose equations are 


y=sinx, y=sin2x, y=sin3x, y=sin4x, &c. 
have been traced for the same interval on the axis of x, from x = 0 to x = 7; and 
then that we have changed these curves by multiplying all their ordinates by the 
corresponding ordinates of a curve whose equation is y = ¢(x). The equations of 
the reduced curves are 


y=sinx d(x), y=sin2x d(x), y = sin3x d(x), &c. 


The areas of the latter curves, taken from x = 0 to x = 7, are the values of the 
coefficients a, b,c, d, &c., in the equation 


1 
BT) =asinx +bsin2x +csin3x +dsin4x + &c, 


§221. We can verify the foregoing equation (D), (Art. 220), by determining directly 
the quantities a), a2, a3, da, &c.., in the equation 


O(x) = ay sinx + a2 sin 2x + a3 sin3x + a4 sin 4x&c.; 


for this purpose, we multiply each member of the latter equation by sin kxdx, k being 
an integer, and take the integral from x = 0 to x = 7, whence we have 


[eo sinkx =a, / sinx sinkx + ao sin 2x sinkx + &c. 
+ aj / sin jx sinkxdx +--+ &c.; (A.1) 


Now it can easily be proved, Ist, that all the integrals, which enter into the second 
member, have a nul value, except only the term ax f sinkx sinkxdx; 2nd, that the 
value of f sinkx sinkxdx; a whence we derive the value of a;, namely 


= f ox) sinkxdx.a 
T 


The whole problem is reduced to considering the value of the integrals which 
enter into the second member, and to demonstrating the two preceding propositions. 
The integral 


2 f sin jx sinkxdx, 
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taken from x = 0 to x = 7, in which k and j are integers, is 


1 1 
- sin(k — j)x -sin(k + j)x +C. 
ee k—j 

Since the integral must begin when x = 0 the constant C is nothing, and the numbers 


k and j being integers, the value of the integral will become nothing when x = 7; it 
follows that each of the terms, such as 


ay f sins sin kx, an f sin2x sin kx, as [ sin3x sinkxdx&c. 


vanishes, and that this will occur as often as the numbers k and j are different. The 
same is not the case when the numbers k and j are equal, for the term _ sin(k — j)x 
to which the integral reduces, becomes 9 and its value is 7. Consequently we have 
thus we obtain, in a very brief manner, the values of a1, a2, a3,..., aj, &c., namely, 
a; = 2 f d(x) sinxdx, az = 2 f d(x) sin 2xdx 
a3 = * { d(x) sin3xdx, a4 = 2 f d(x) sin4xdx 


~~ 


Substituting these we have 
1 : : ; : 
anol) = sinx 7 (x) sinxdx + sin 2x / (x) sin2xdx + &c. 
+ sinkx 7 o(x) sinkxdx + &c. (A.2) 


§222. The simplest case is that in which the given function has a constant value for all 
values of the variable x included between 0 and 7; in this case the integral [ sin kxdx 
is equal to ;, if the number k is odd, and equal to 0 if the number k is even. 

Hence we deduce the equation 


1 1 1 1 
a =sinx + 3 sin 3x + 5 sin 5x + 5 sin 7x + &c., 
which has been found before. 
It must be remarked that when a function has been developed in a series of sines 
of multiple arcs, the value of the series 
asinx + bsin2x + csin3x +d sin4x + &c. 
is the same as that of the function ¢(x) so long as the variable x is included between 


0 and 7; but this equality ceases in general to hold good when the value of x exceeds 
the number 7. 
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Suppose the function whose development is required to be x, we shall have, by 
the preceding theorem, 


1 
aut — sin f sin xdx + sin 2x fx sin 2xdx + &c. 


+ sin kx | x sinkxdx + &c. (A.3) 


The integral Ih x sin kxdx is equal to + {; the indices 0 and which are connected 
with the sign [’, shew the limits of the integral; the sign + must be chosen when k is 
odd, and the sign — when k is even. We have then the following equation, 


1 1 1 1 
x = sinx — 5 Sin 2x + oe _ q sindx -f =e — &c. 


Nile 


§223. We can develope also in a series of sines of multiple arcs functions different 
from those in which only odd powers of the variable enter. To instance by an example 
which leaves no doubt as to the possibility of this development, we select the function 
cos x, which contains only even powers of x, and which may be developed under the 
following form: 


asinx +bsin2x + csin3x +d sin4x + esin5x + &c., 


although in this series only odd powers of the variable enter. 
We have, in fact, by the preceding theorem, 


1 
ao cosx = sinx | cosxsinxdx + sin2x | cosx sin2xdx 


+ sin 3x i cos x sin3xdx&c. (A.4) 


The integral [ cos x sin kxdx is equal to zero when k is an odd number, and to ah 
when k is an even number. Supposing successively k = 2, 4, 6, 8, etc., we have the 
always convergent series 


: Dai de sated de” ogee ies 
—7 COSX = — SIN ZX — sin4x — Sin 0x a3 
4 13 35 5 


or, 


ENE aS ween 4) andy (he! nee 
Cos == = — } sin = — J sin = — J sin ofe 
ee | Ria pal ae NS 4 ne 


This result is remarkable in this respect, that it exhibits the development of the 
cosine in a series of functions, each one of which contains only odd powers. If in the 
preceding equation x be made equal to iT, we find 
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la 1 a 1 leet a 3 & 
= Cc. : 
4/2 2\1 3 5 7 9 11 


This series is known ((Euler] /ntrod. ad analysin. infinit. cap. xX.). 

[In §224 Fourier showed that “a similar analysis may be employed for the develop- 
ment of any function whatever in a series of cosines of multiple arcs.” He concluded 
that “This and the preceding theorem suit all possible functions, whether their char- 
acter can be expressed by known methods of analysis, or whether they correspond 
to curves traced arbitrarily.” ] 


§230. If we apply these principles to the problem of the motion of vibrating strings, we 
can solve difficulties which first appeared in the researches of Daniel Bernoulli. The 
solution given by this geometrician assumes that any function whatever may always 
be developed in a series of sines or cosines of multiple arcs. Now the most complete 
of all the proofs of this proposition is that which consists in actually resolving a given 
function into such a series with determined coefficients. 

In researches to which partial differential equations are applied, it is often easy 
to find solutions whose sum composes a more general integral; but the employment 
of these integrals requires us to determine their extent, and to be able to distinguish 
clearly the cases in which they represent the general integral from those in which 
they include only a part. It is necessary above all to assign the values of the constants, 
and the difficulty of the application consists in the discovery of the coefficients. It is 
remarkable that we can express by convergent series, and, as we shall see in the sequel, 
by definite integrals, the ordinates of lines and surfaces which are not subject to a 
continuous law. We see by this that we must admit into analysis functions which have 
equal values, whenever the variable receives any values whatever included between 
two given limits, even though on substituting in these two functions, instead of the 
variable, a number included in another interval, the results of the two substitutions 
are not the same. The functions which enjoy this property are represented by different 
lines, which coincide in a definite portion only of their course, and offer a singular 
species of finite osculation. These considerations arise in the calculus of partial 
differential equations; they throw a new light on this calculus, and serve to facilitate 
its employment in physical theories. 


§235. It follows from that which has been proved in this section, concerning the 
development of functions in trigonometrical series, that if a function f(x) be pro- 
posed, whose value in a definite interval from x = 0 to x = X is represented by 
the ordinate of a curved line arbitrarily drawn; we can always develope this function 
in a series which contains only sines or only cosines, or the sines and cosines of 
multiple arcs, or the cosines only of odd multiples. [...] These trigonometric series, 
arranged according to cosines or sines of multiples of arcs, belong to elementary 
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analysis, like the series whose terms contain the successive powers of the variable. 
The coefficients of the trigonometric series are definite areas, and those of the series 
of powers are functions given by differentiation, in which, moreover, we assign to the 
variable a definite value. We could have added several remarks concerning the use 
and properties of trigonometrical series; but we shall limit ourselves to enunciating 
briefly those which have the most direct relation to the theory with which we are 
concerned. 

1st. The series arranged according to sines or cosines of multiple arcs are always 
convergent; that is to say, on giving to the variable any value whatever that is not 
imaginary, the sum of the terms converges more and more to a single fixed limit, 
which is the value of the developed function. 


Fourier’s Proofs 


Fourier deferred the proofs of his theorems largely to the end of his book, when he 
availed himself of infinitely large and infinitely small quantities, writing (in §417) 
of one proof that 


it supposes that notion of infinite quantities which has always been admitted by geometers. 
It would be easy to offer the same proof under another form, examining the changes which 
result from the continual increase of the factor p under the symbol sin p(a — x). These 
considerations are too well known to make it necessary to recall them. 


This was the proof of equation (B), which he went on to turn into the equation 
(B’) he referred to in the extract that follows: 


fos f eg) ieee: 
T J—oo a-—x 


He added 


Above all, it must be remarked that the function f(x), to which this proof applies, is entirely 
arbitrary , and not subject to any continuous law. 


418. The theorem expressed by Eq. (II) Art. 234 must be considered under the 
same point of view. This equation serves to develope an arbitrary function f(x) in 
a series of sines or cosines of multiple arcs. The function f(x) denotes a function 
completely arbitrary, that is to say a succession of given values, subject or not to 
a common law, and answering to all the values of x included between 0 and any 
magnitude X. 

The value of this function is expressed by the following equation, 


b ‘ 
f@=-> [ dafla)cos*(x—a) (A). 


423. The propositions expressed by Eqs. (A.) and (B’), Arts. 418 and 417, may be 
considered under a more general point of view. The construction indicated in Arts. 
415 and 416 applies not only to the trigonometrical function 
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sin(pa — px) 
a-x ” 


but suits all other functions, and supposes only that when the number p becomes 
infinite, we find the value of the integral with respect to a, by taking this integral 
between extremely near limits. Now this condition belongs not only to trigonomet- 
rical functions, but is applicable to an infinity of other functions. We thus arrive at 
the expression of an arbitrary function f(x) under different very remarkable forms; 
but we make no use of these transformations in the special investigations which 
occupy us. 

With respect to the proposition expressed by Eq. (A), Art. 418, it is equally easy 
to make its truth evident by constructions, and this was the theorem for which we 
employed them at first, It will be sufficient to indicate the course of the proof. 

In Eq. (A), namely, 


a—-x 


1 +X +00 
fas fe dap) Dees 7 : 


we can replace the sum of the terms arranged under the sign > by its value, which is 
derived from known theorems. We have seen different examples of this calculation 
previously, Sect. HI, Chap. III. It gives as the result if we suppose, in order to simplify 
the expression, 27 = X, and denote a by r,! 


i sinr 
> 5 cos jr = cos jr + sin jr ———. 

versin r 

-i 

We must then multiply the second member of this equation by da f (a), suppose the 
number j infinite, and integrate from a = —7 to a = +7. The curved line, whose 
abscissa is a and ordinate cos jri being conjoined with the line whose abscissa is a 
and ordinate f(a), that’ is to say, when the corresponding ordinates are multiplied 
together, it is evident that the area of the curve produced, taken between any limits, 
becomes nothing when the number j increases without limit. Thus the first term 
cos jr gives a nul result. 

The same would be the case with the term sin jr, if it were not multiplied by the 
factor ne re but on comparing the three curves which have a common abscissa a, 
and as ordinates sin jr 


sinr 
> versinr’ 


fa), we see clearly that the integral 


versin r 


: sinr 
[aor sin jr ——— 


has no actual values except for certain intervals infinitely small, namely when the 


ordinate ane becomes infinite. This will take place if r or a — x is nothing; and 


'versinr = 1 — cosr. 
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in the interval in which a differs infinitely little from x, the value of f(a) coincides 
with f(x)/. Hence the integral becomes 


ee or 1 Cdr. 
2f (x) dr sin jr=—, or =f (x) —sinjr, 
) xr? 2 0 i 


which is equal to 27 f(x), Arts. 415 and 356. Whence we conclude the previous 
Eq. (A). 

When the variable x is exactly equal to —7 or +7, the construction shews what 
is the value of the second member of the Eq. (A), [5 f(-7) or 5f(r)). 

If the limits of integrations are not —7 and +7, but other numbers a and b, each 
of which is included between —7 and +7, we see by the same figure what the values 
of x are, for which the second member of Eq. (A) is nothing. 

If we imagine that between the limits of integration certain values of f (a@) become 
infinite, the construction indicates in what sense the general proposition must be 
understood. But we do not here consider cases of this kind, since they do not belong 
to physical problems. 

If instead of restricting the limits —7 and +7, we give greater extent to the integral, 
selecting more distant limits a’ and b’, we know from the same figure that the second 
member of equation (A) is formed of several terms and makes the result of integration 
finite, whatever the function f(x) may be. 

We find similar results if we write 27 °>* instead of r, the limits of integration 
being —X and+X. 

It must now be considered that the results at which we have arrived would also 
hold for an infinity of different functions of sin jr. Itis sufficient for these functions to 
receive values alternately positive and negative, so that the area may become nothing 
when j increases without limit. We may also vary the factor aut ; as well as the 
limits of integration, and we may suppose the interval to become infinite. Expressions 
of this kind are very general, and susceptible of very different forms. We cannot 
delay over these developments, but it was necessary to exhibit the employment of 
geometrical constructions; for they solve without any doubt questions which may 
arise on the extreme values, and on singular values; they would not have served to 
discover these theor.ems, but they prove them and guide all their applications. 


A.2. Dirichlet on Fourier Series 


Dirichlet, On the convergence of trigonometric series which serve to represent an 
arbitrary function between two given limits. Journal fiir die reine und angewandte 
Mathematik, 1829, 4, 157-169, in Werke, I, 117-133. 

The series of sines and cosines, by means of which one can represent an arbitrary 
function in a given interval, enjoy among other remarkable properties that of being 
convergent. This property has not escaped the illustrious geometer who has opened 
a new domain in the applications of analysis, in introducing there the manner of 
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expressing arbitrary functions that is under consideration; it is found in the Memoir 
which contains his first researches on heat. But no one, as far as I know, has given a 
general demonstration. I only know one work on this topic, which is by M. Cauchy 
and which appeared in the Memoirs of the Paris Academy of Sciences for the year 
1823. The author of this work himself admitted that his proof was defective for certain 
functions for which however the convergence was indubitable. A close examination 
of the said memoir has led me to believe that the demonstration which is expounded 
there is not even sufficient in the cases to which the author believes it is applicable. 
Before entering into this matter I am going to state in a few words the objections 
to which the proof of M. Cauchy appears to me to be subject. The path which this 
famous geometer has taken in this research requires that one considers that values 
of the function ¢(x) that one wishes to develop, when one replaces the variable x 
in it by a quantity of the form u + v./—I. The consideration of these values seems 
foreign to the question, and in any case one does not easily see what one must 
understand by the result of a similar substitution when the function one is given 
cannot be expressed by an analytic formula. I give this objection with even more 
confidence because the author seems to share my opinion on this point. In fact, in 
several of his works he insists on the necessity of defining in a precise manner the 
sense one attaches to a similar substitution even when this is done to a function 
with a regular analytic law. Above all one finds in the Memoir which is published 
in volume 19 of the Journal de l’Ecole Polytechnique page 567 and following, some 
remarks on the difficulties that imaginary quantities give rise to when placed in the 
signs of arbitrary functions. What ever one makes of this first observation, there is 
still another objection to the demonstration that M. Cauchy gives that seems to leave 
no doubt about its insufficiency. The consideration of imaginary quantities leads the 
author to a result on the decrease of terms in a series that is far from proving that the 
terms form a convergent sequence. The result under considerataion can be stated as 
follows, supposing that the interval considered extends from zero to 7. 

“The ratio of the term of rank n to the quantity ASinnx (A being a determinate 
constant, depending on the extreme values of the function) differs from unity taken 
positively by a quantity which diminishes indefinitely as n becomes greater.” 

From this result and from the fact that the series which has A multe as its general 
term is convergent, the author concludes that the general trigonometric series is too. 
But this conclusion is not permissible, because it is easy to arrange that two series 
(at least when, as happens here, the terms do not always have the same sign) can be 
the one convergent and the other divergent, although the ratio of the two terms of the 
same rank differs as little as one wishes from unity taken positively when the terms 
are of a very high rank. 

One sees a very simple example in the two series, the one having the general 


(—1" (-1") (-l" ‘ r 
terms ae and the other =k. (1 + A ). The first of these series is convergent, 


the second on the contrary is divergent, because on subtracting the first from it one 
obtains the divergent series 
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1 
vn? 


and however the ratio of the corresponding two terms, which is | + converges 


to unity as 7 increases. 

I am now going to enter into the matter, beginning with an examination of the 
simplest cases, to which all the others can be reduced. Let us denote by h a positive 
number less than or at most equal to 7/2 and by f (() a function of ( that is continuous 
between the limits 0 and /; I understand by that a function that has a finite and definite 
value for every value of @ between 0 and h, and is also such that the difference 
f(6+ ¢) — f() decreases without limit when ¢ becomes smaller and smaller. 
Suppose finally that the function is always positive between 0 and / and that it 
decreases constantly from 0 to / in such a way that if p and g are two numbers 
between 0 and h, f(p) — f(q) always has the opposite sign to p — q. That done, let 


us consider the integral 
 sinkG 
—, f(8)a8 () 
o sin’ 


in which k is a positive quantity, and let us see what this integral becomes as k 
increases.? To do this, we divide it into several others, the first taken from @ = 0 to 
( = 7/k, the second from 3 = 7/k to 3 = 2, and so on, the penultimate one having 


the limits vbr and re and the last the limits re and h, rt denoting the greatest 
multiple of 7/k which is contained in h. It is easy to see that these new integrals, 
which are r + | in number, are alternately positive and negative, the function under 
the integral sign evidently being positive between the first limits, negative between 
the limits of the second, and so on. It is no less easy to convince oneself that each one 
is smaller than the one before, abstraction being made of sign.? In fact v denoting 
an integer less than r, the expressions: 


ua/k : (V+1)r/k o 
[ ink eayap, | sink? eayap 


v—l)r/k sin w/k sin 3 


represent two consecutive integrals. In the second, let us replace 3 by 7/k + (3; the 
integral then changes into this: 


i k sin (kG +7) 
( 


TT 
v—l)n/k Sin (B+ 1/k) is ral 


or, which comes to the same thing 


Uae sink 
= eas area +7/k)dp. 
— sin Gia” ee 


?Dirichlet wrote i for k. 
3The 19th Century phrase for ‘in absolute value’. 
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The two integrals which we thus wish to compare having therefore the same limits, 
one sees without difficulty that the second has a numerical value less than the first. 
For that it is sufficient to note that it follows from the supposition made about the 
function f (3): 


f(E+8) < sO. 


and that on the other hand 


sin (= + 8) > sin GZ, 
the arcs 3 and ¢ + 3 each being less that 7/2, from which the inequality follows 


that: 
f(B) a f(@+ %) 
sin@~ sin(@+7/k)’ 


which holding for all values of 3 between the limits (v — 1)% and vx/k shows, as 
we have said, that each integral is bigger than the one before, abstraction being made 
of sign. This circumstance holds a fortiori when one compares the penultimate one 
with the last one, noting that the difference of the limits re and h of the last is less 
than 2/k, the common difference of all the others. 

Now let us examine in a little more detail the integral of rank v, which is: 


va/k ink 
/ sink? eas. 


v-1)n/k sin B 


as the function of @ which is found under the integral sign is a product of factors 
snk and f (3), which are both continuous functions of @ between the limits of 
integration, and as on the other hand the first of these factors always has the same 
sign between these same limits, one concludes, in virtue of a known theorem, that 
the integral under consideration is equal to the integral of the first factor multiplied 
by a quantity lying between the greatest and the least value of the other factor. As 
the second factor decreases from the first limit to the second, the quantity under 
consideration lies between f((v — 1)Z) and f(v7/k). Denoting this by p,, our 
integral will be equivalent to: 


vm/k sink 
| ae 


Ane sme 


the integral which is still contained in this expression depends both on v and k. It is 
positive or negative according as 1 — | is even or odd; we shall nonetheless denote 
it by K,, abstraction being made of sign. We shall however need to know the limit 
to which this converges when, v remaining constant, k becomes greater and greater. 
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To discover this limit, let us replace G by y/k, y being a new variable. We shall then 


have: ' 
- sin y dept 
Tn aY/ K- 
(v—1)r Sin(y/k) 


In this form, it is evident that it converges to the limit 


vT™ sin 
/ tay, 
(v-l)r 


which, to abbreviate, we denote by k,, abstraction being made of sign. 

One knows that the integral [)° “7 has a finite value and is equal to 7/2. This 
integral can be divided into an infinity of others, the first taken from y = 0 toy = 7, 
the second from y = 7 to y = 27, and so on. These new integrals are alternately 
positive and negative, each of them has a numerical value less than the one before, 
and that of rank vy is k,, abstraction being made of sign. The proposition we have 
cited reduces therefore to saying that the infinite sequence: 


ky —ko+k3 —ka tks —etc. (2) 


is convergent and has a sum equal to 7. 

The terms of this sequence always decrease, and it follows from a known propo- 
sition that the sum of the first n terms is greater or less than 7/2 according as n is 
odd or even, and that this sum, which one can denote by S,,, differs from 7/2 by a 
quantity less than the following term ky+1. 

Let us now return to integral (1) and seek to find the limit to which it converges 
as k increases indefinitely. In making the number k thus increase, the integrals into 
which we have decomposed integral (1) change their values continually and at the 
same time their number will increase; it is necessary to know the result of this double 
change when it is continued indefinitely. For this we take an integer m (which we shall 
suppose is even for simplicity) and suppose that m remains fixed while k increases. 
The number r, which increases continually with & will eventually finish by surpassing 
the fixed number m, however large it is chosen. 

This done, we divide the integrals into two groups whose sum is equivalent to 
integral (1). The first group will contain the first m of these integrals, the second will 
be composed of all that follow. One has, for the sum of the first group 


Ki p1 — K2p2 + K3p3 — Kapat+---— Kmpm (3) 


and the second, the number of whose terms increases continually with k has for its 
first terms: 
Km4iPm+1 — Km+2Pm+42 +°°: (4) 


Let us consider these two groups separately. As the number k increases indef- 
initely, the sum (3) converges to a limit that it is easy to determine. In fact, the 
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quantities ~1, 92,..., Pm, which are contained the first between f (0) and f (%), the 
second between f({) and f(2{), and the last between f((m — 1)Z) and f(m{) 
each converge to the limit f (0) as, m remaining fixed, k increases without limit. On 
the other hand we have seen that the quantities: 


Ky, Ko,..., Km 
converge in the same circumstances to the respective limits: 

ki, ka,..., km. 
Therefore the sum (3) converges to the limit: 


(ky — ka + k3 —-+»—km) f(O) = Sm f(O), 


which is to say that the difference between the sum (3) and S,, f(O) will always 
finish, abstraction being made of sign, by being constantly less that w, w denoting a 
positive quantity as small as one wishes. 

Let us now consider the sum (4) the number of terms of which increases contin- 
ually. These terms being alternately positive and negative and each of them having 
a numerical value less than the term before, as we have seen above, and considering 
the integrals that the terms represent, if follows from a known principle* that this 
sum, whatever the number of terms, is positive like its first term Ky+1$m+1and has 
a value less than this term. Now, this first term converging to the limit k,,+  f (0), it 
follows that the sum (4) will always be less than kj,+1 f (0) increased by a positive 
quantity w’ as small as one wishes. In combining this result with the one we obtained 
for the sum (3) only a moment ago, one sees that integral (1), which is the sum of 
expressions (3) and (4) will always finish by differing from S,, f (0) by a quantity less, 
abstraction being made of sign, than w+w’ +k»+1 f (0), w and w” being quantities of 
arbitrary smallness. On the other hand S,, differs from 5 by a quantity numerically 
less than k,, + 1; therefore the integral will always finish by differing from 5 f (0) 
by a quantity less than w + w’ + 2kin+1 f (0), abstraction being made of sign. 

As m can be chosen so great that kj, will be less than any given quantity, 
it follows that integral (1) will always finish when k increases without limit, by 
constantly differing from (7/2) f(0) by a quantity less, abstraction being made of 
sign, than any number as small as you wish. It is therefore proved that integral (1) 
converges to the limit (7/2) f (0) for increasing values of k. 

Let us now suppose that the function f (3), instead of always decreasing from 0 
to h, is constant and equal to unity. In this case one can determine the limit to which 
integral (1) converges by the same considerations as we have already used; one can 


‘The principle that we are going to apply can be stated in this way. The letters A, A’, A”,... 
denoting an arbitrary number of positive quantities such that A > A’ > A” etc., the quantity 
A—A’+A"”— A” + etc. is positive and less than A. This follows immediately from the fact that the 
preceding quantity can always be put in one of the two following forms: (A— A’)+(A” — A”) +--+ 
or A — (A! — A”) — (A” — Al”) ee, 
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see this at once by recalling that the preceding demonstration is based on the fact 
that the integrals into which we have decomposed integral (1) form a decreasing 
sequence. Now this decrease depends on two things, the decrease in the factor f(() 
and the increase in the divisor sin 7. If f (3) becomes a constant number, the increase 
in sin 3 will always suffice to make each integral in the series smaller than the one 


before. One thus finds, always supposing h to be positive and at most equal to 5 


that the integral 1 sinks 


h sink : ‘ . re * boos T 
i ain B d3, in which c is a positive or negative constant, converges to the limit c7. 


We have supposed that the function f (() was decreasing and positive between the 
limits 0 and h. The first circumstance continuing to hold, that is to say the function 
being such that f(p) — f(q) has the opposite sign to p — q for values of p and q 
between 0 and h, let us suppose that f (3) is not always positive. One can take a 
positive constant c so great that c+ f (3) always takes a positive sign from 3 = 0 


to 3 =h. The integral is sinks d being equal to the difference of these: 


h sink h sink 
[et+t@ a8 [eas 


the limit will be the difference of the limits to which these latter converge. Now 
these latter belong to the case previously examined (c + f((@) being a decreasing 
and positive function) and converge to the limits [c + f (O)1F and ae whence the 
first converges to the limit 5 f (0). 

Let us now consider a function f() increasing from 0 to h. In this case — f (3) 


sink@ : 
aa G will therefore converge to 


the limit —4 f (0), and consequently the integral i aid G to the limit 5 f (0). 
Putting these results together. One has this statement: Whatever be the function 

J (8), provided that it remains continuous between the limits 0 and h (h being positive 

and at most equal to ak and whether it increases or decreases from the first of these 


limits to the second, the integral i, f(2) a 
from 5 f (0) by a quantity less than any number assignable when one lets k increase 
beyond any positive limit. 

Let us denote by g a positive number different from 0 and less than h, and let 
us suppose that the function remains continuous and increases or decreases from 
g to h. The integral i f(B nk. d3 will then converge to a limit that it is easy to 
discover. One can proceed by considerations analogous to those that we have applied 
to integral (1); but it is simpler to reduce this new case to those we have considered in 
what has preceded. The function being given only from g to h it is entirely arbitrary 
for values of G between 0 and g. Suppose that one understands by f (3), for values 
of 3 between 0 and g, a continuous function that increases or decreases from 0 to g 
according as f (3) increases or decreases from g to h; suppose moreover that f (g—¢) 
differs infinitely little from f(g + €) if ¢ decreases without limit; having satisfied 


these conditions in an arbitrary way, which one can always do in infinitely many 


d@ converges to the limit 5. It follows that the integral 


will be a decreasing function. The integral rs 


d( will finish by constantly differing 
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ways, the function f (3) satisfies from 0 to / the conditions expressed in statement 
(5). The integrals: 


9 sink sink@ 
[rota [soe Sas 


will therefore converge, both of them, to the limit 5 (0). Whence one concludes 
that the integral ic * £(B) snk? d 3, which is the difference of the preceding, has zero 
as its limit. 

This new result can be put together in a single statement with the one we have 
obtained above. One will then have: The letter / denoting a positive quantity at most 
equal to 4 and g being a quantity that is also positive and in addition less than /, the 


integral 
i one 
[ oe 


in which the function f(() is continuous between the limits of integration and is 
either always increasing or always decreasing from 3 = g to 3 = h, will converge 
to a certain limit when the number k becomes greater and greater. This limit is equal 
to zero except in the case when g has the value null, in which case it has the value 
= F(0). 

It is evident that this result will only be lightly modified if the function f(@) 
interrupts the continuity at G = g or 6 = h, that is to say if f(g) were different from 
f(g +e) and f(h) from f(h — ¢), € denoting an infinitely small positive quantity, 
provided that the values f(g) and f() do not become infinite. It is only necessary 
in this case to replace f (0) by f(e) in the preceding statement, considering that f (€) 
is equal to f(0). 

We are now ready to prove the convergence of periodic series which express 
arbitrary functions between given limits. The path which we shall follow will lead 
us to establish the convergence and at the same time to determine their values. Let 
(x) be a function of x, having a finite and definite value for each value of x lying 
between —7 and 7, and suppose that it is required to develop this function in a series 
of sines and cosines of multiple arcs of x. The series which solves this question is, 
as one knows: 


~ / d(a)da 


1 [ cosx [ ¢(a) cosada + cos 2x | (a) cos2ada +--- (7) 
x | sinx f $(a) sinada + sin2x f ¢(a)sin2ada+--- |’ 

the integrals which determine the constant coefficients being taken from a = —7 to 
a = 7 and x denoting an arbitrary quantity lying between —7 and 7. (Théorie de la 
Chaleur No. 232 ff.). 
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Let us consider the first 2n + | terms of this series (n being an integer) and see 


to what limit the sum of these terms converges when n becomes greater and greater. 
This sum can be put in the following form: 


~ o(a)da E + cos(a — x) + cos2(a@— x) + + cosn(a—3)| ; 
T Jan 


or, on summing the sequence of cosines: 


m sin(n + 5)(a — x) 


1 
= (a) 
WT Jar 


da. (8 
2sin 5(a — x) ee 


All now reduces to determining the limit to which this integral continually 
approaches when n increases indefinitely. For this we divide it into two parts, the 
one taken from —z to x, the other from x to 7. If one replaces a in the first one by 
x — 2, and in the second a by x + 2, @ being a new variable the two integrals 
change into these, abstraction being made of the factor +: 


(w+x)/2 
| anne jG Ode. 
0 sin 3 
(n-)/2 «i 
[ Sinn + OP be 28aB. @) 
0 sin 3 


Let us consider the second of these two integrals. The quantity x being less than 
or equal to 77, abstraction being made of sign, (7 — x)/2 cannot fall outside the limits 
0 and a. If (7 — x)/2 = 0, which happens when x = 7, the integral is null whatever 
n is; in all other cases it will converge for increasing values of n to a limit that we 
are going to determine. Let us suppose first of all that (7 — x)/2 is less than or at 
most equal to 73 and let us note that the function ¢(x + 2) can have several breaks 
in continuity from 3 = 0 to 8 = (a — x)/2, and that it can also have several maxima 
and minima in this same interval. Let us denote by @, £’, @”,..., @™, ranged in order 
of size, the different values of ( which arise at one or other of those circumstances, 
and let us decompose our integral into several others taken between the limits: 


1 
O and @, @ and ¢’, é’ and ¢”,..., 2 and a —x). 


All these integrals occur in the case of statement (6). They all therefore converge 
to the limit zero as n increases, with the exception of the first which converges to the 
limit 5 (x + e), € being an infinitely small positive number. 

If 5(0 — x) is greater than 5, which will happen when x has a negative value, one 
divides the integral into two others, one taken from 3 = 0 to 3 = 4, the other from 


B=Ftos= 5(0 — x). The first of these two integrals is one that we have already 
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considered, it will therefore converge to the limit F(x +e). As for the second, it 
can be changed into this, on replacing @ by 7 — 4, y being a new variable: 


T 


2 sin(2n + 1)(7 — 
| (eon oy 
(w+x)/2 sin(t — ¥) 


or, which comes to the same thing, n being an integer: 


T 


i. Oe a sa al ELE 
(w+x)/2 sin(7) 


It thus has a form analogous to the preceding, and decomposing it like the preceding 
into several others, one sees that it will converge to the limit zero, the sole case 
excepted where Gs + x) has the value null, that is to say when x = —z; in this 
case it continually approaches the limit [5] o(m — €), € always having the same 
signification. In summarising all that has gone before, we find that the second integral 
of (9) is zero when x = 7, that it converges to the limit 7 [o(m—€)+6(—7+¢)] when 
x = —n, and that in all other cases it continually approaches the limit 5 (x + ¢). 

The first of the integrals (9) is entirely analogous to the second; on applying similar 
considerations to it, one finds that it is null when x = —z7, that it converges to the 
limit 5lo(a —e)+0(—a+€)] when x = 7 and that in all other cases it has the limit 
5(x — €). Knowing in this way the limits of each of the integrals (9), it is easy to 
find the limit that integral (8) continually approaches when n becomes greater and 
greater. It is sufficient for this to recall that each integral is equal to the sum of the 
integrals (9) divided by 7. Now, integral (8) being equivalent to the sum of the first 
2n + | terms of the series (7), the convergence of this series is proved, and one finds 
by means of the preceding results that it is equal to 


1 
3 lo —e)+o@+8)] 


for every value of x between —7 and 7, and that for each of the extreme values pi 
and 7 it is equal to 


1 
5 lotr —e)+ o(-7 + 6¢)]. 


The preceding exposition embraces all cases; it simplifies when the value of x is 
not one of those that presents a break in continuity. In fact, the quantities @(x + €) 
and ¢(x — €) then each being equivalent to ¢(x), one sees that the series has the 
value d(x). 

The preceding considerations prove in a rigorous manner that, if the function (x), 
whose values are supposed finite and definite, has only a finite number of breaks of 
continuity between the limits —7 and 7, and if moreover it has only a definite number 


5The factor [5] was omitted by Dirichlet. 
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of maxima and minima between these same limits, then series (7), whose coefficients 
are definite integrals depending on the function ¢(x), is convergent and generally 
has the value expressed by 


1 
lo —e)+ oe +e)], 


where ¢€ denotes an infinitely small number. It remains to consider the case where 
the assumptions that we have made about the number of breaks in continuity and on 
the natural number of maxima and minima cease to hold. These singular cases can 
be reduced to those that we have considered. It is only necessary for series (8) to 
make sense when the breaks in continuity are infinite in number, that the function 
@(x) satisfies the following condition. 

It is then necessary that the function @(x) be such that, if one denotes by a and b 
two arbitrary quantities lying between —7z and 7, one can always find between a and 
b two quantities r and s so close that the function is continuous in the interval from 
r to s. One easily appreciates the necessity of this restriction on considering that the 
different terms of the series are definite integrals and going back to the fundamental 
notion of integrals. One will then see that the integral of a function only signifies 
something when the function satisfies the condition previously stated. One will have 
an example of a function that does not satisfy this condition, if one supposes that d(x) 
is equal to a definite constant c when the variable x has a rational value, and is equal 
to a different constant d when this variable is irrational. The function thus defined 
has finite and definite values for each value of x, however, one cannot substitute it in 
the series because the different integrals which enter this series lose all signification 
in this case. The restriction which I have made precise, and that of not becoming 
infinite, are the only ones to which the function ¢(x) must be subjected, and all the 
cases that we have not excluded can be reduced to those we have considered in the 
preceding. But this matter, to be done with all the clarity that one desires, requires 
some details related to the fundamental principles of analysis, and will be expounded 
in another note, where I will also consider some other quite remarkable properties 
of series (7). 


A.3 Riemann on Elementary Complex Function Theory 


G.B.F. Riemann, Foundations for a general theory of functions of a variable com- 
plex quantity. Inauguraldissertation, Gottingen, 1851, second unaltered publication, 
Gottingen, 1867. 

§ 1. 


®In making this translation, I consulted the one in A Source Book in Classical Analysis, G. Birkhoff 
and U. Merzbach (eds.) 1973, pp. 48-50, and some phrases of that translation appear here. 
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If one thinks of z as a variable that successively takes all possible real values, then 
if to each value of z there corresponds a single value of w, w is called a function 
of z. If, when z varies continuously on an interval, w also varies continuously, this 
function is called continuous on this interval. 

This definition evidently does not assume any fixed law describing the function 
since, once this function has been defined on a given interval, it may be extended 
arbitrarily outside it. 

The dependence of w on z may be given by a mathematical law which determines 
from each value of z the corresponding value of w. Formerly only certain kinds of 
functions (functiones continuae in Euler’s terminology) were considered capable of 
satisfying the same law of dependence for all values of z in a given interval. Recent 
researches have shown, however, that there exist analytic expressions by which any 
continuous function may be represented on a given interval. It is therefore all the 
same whether the function is defined arbitrarily or by a formula. Because of the 
theorem recalled above, the two concepts are equivalent. 

However, it is otherwise when z is not restricted to real values, and complex 
numbers of the form z = x + iy (where i = /—1) are included. 

Let x + yi and x + yi + dx + dyi be two values of the quantity z differing 
infmitesimally, to which correspond the values u + iv and u + vi + du + dvi of the 
quantity w. Then if the dependence of the quantity w on z is taken to be arbitrary ... 


the ratio ners will in general vary with dx and dy. For, if one sets dx +dyi = ce®, 
then 


du+dvi 1/f0u Ov 1fov O 
dx +dyi =3(K+R)+3(F ) 
1f[Ou Ov ) 

+35 a+ (> =) 


v 
+3) 
dx—dyi 1 aks 1f/Ov Ou). 
dx+dyi 2\@x dy 2\0ox Oy 

+ )‘| er, 


Ou 

Oy 

a! Ou an Ov Ou 
20x Ody Ox Oy 
But no matter how w is determined as a function of z by tying together simple 


algebraic operations the value of the derivative [or differentialquotient] ae must 


always be independent of the particular value of dz.’ Evidently, not every dependence 
of the complex quantity w on the complex quantity can be expressed in this way. 
This characteristic property of all functions defined by explicit operations, will 
be taken as basic in what follows, where a function will be considered independent 
of its expression, and we, without proving the general validity and permissibility of 


7This requirement is evidently satisfied in all cases where from the expression for w in terms of z 


an expression for du in terms of z is obtained by means of the rules for differentiation; the rigorous 


general validity of this remains for now undiscussed. 
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expressing every dependence through operations on quantity, shall proceed from the 
following definition: 

A variable complex quantity w is called a function of another variable complex 
quantity z when its variation is such that the value of the derivative ue is independent 
of the value of the differential dz. 


§ 2. 


Let the quantities z and w be complex be taken as variables that can take each take 
complex values. The presentation of their variation on a connected two-dimensional 
domain is made essentially easier by a connection to spatial intuition. One thinks 
of each value x + iy of the quantity z as represented by the point O of the plane 
A with rectangular coordinates (x, y), and each value u + vi by the point Q of the 
plane B with rectangular coordinates (u, v). Every dependence of the quantity w on 
z will then be represented by the dependence of the position of the point g on that 
of the point O. If to each value of z there corresponds a value of w, which varies 
continuously with z, in other words, if u and v are continuous functions of x and 
y—then to each point of the plane A will correspond a point of the plane B, every 
curve generally to a curve, and every connected piece of surface to a connected piece 
of surface. One can think of this dependence of the quantity w on the quantity z as 
a mapping of the plane A onto the plane B. 


§ 3. 


We now investigate what properties this mapping has when w is a function of the 
complex quantity z, that is when oe is independent of dz. We denote by o be an 
arbitrary point of the plane A in the ‘neighbourhood of O, and by q its image in the 
plane B, also, let x + yi+dx-+dyi andu+vi +du-+duvi be the values of z and w 
at these points. Then dx, dy and du, dv can be regarded as rectangular coordinates 
of the points o and gq relative to the points O and Q considered as origins, and if one 
writes dx +dy = ce” and du+dvi = ne“ then the quantities ¢, d, 1, w will be the 
polar coordinates of these points for the same origins. Now if o’ and o” are any two 
specified locations of the point o infinitely near to O, and if one denotes the other 
variables by the corresponding indices, then we have 


du! +dv'i — du" +dv'i 
dx'+dyli dx" + dy"i 


and consequently 


du'+dvi _ 1 wi dx' + dy'i = e! oi 


du" + dv"i — nl! = dx" + dy"i el 
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t 


whence an a g and w’ — wo” = d — ¢", that is to say, in the triangles 0’ Oo” 
and q'Qq” the angles o’Oo” and q’Qq” are equal and the sides enclosing them 
proportional. 


Thus there exists equality in the smallest parts between two corresponding infin- 
itesimal triangles and consequently between the plane A and the lane B in general. 
An exception occurs only in the special case when the corresponding variations in 
the quantities z and w does not have a finite ratio, which is here tacitly assumed. 


§4. 


du+dvi 


If one writes the derivative 7— a in the form 


(34 + Hi) dx + (3 - Hi) avi 
dx + dyi 


it is clear that its value will be independent of dx and dy if and only if 


These conditions are therefore necessary and sufficient for w = u + vi to be a 
function of z = x + yi. Consequently the individual parts of this function satisfy the 
following 


Cu Ou Ov &v 


aa® * Bye BRP By?” 


which will be taken as fundamental for the study of the properties of the individual 
parts of such a function, considered separately. 


A.4 Riemann’s Definition of the Integral 


From Riemann On the representability of a function by a trigonometric series: The 
concept of a definite integral and the scope of its validity. 


§4 


The indeterminancy which still prevails on a number of fundamental points of the 
theory of definite integrals compels us to make some preliminary remarks about the 
concept of a definite integral and the scope of its validity. 

So first: what have we to understand by [ : f(x)dx? 
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To determine this, we take a sequence of values x1, x2,..., X,—1 increasing in 
size between a and b, and for brevity denote x; — a by 61, x2 — x1 by 62,...,b—Xn_1 
by 6,, and by € a positive real fraction. Then the value of the sum 


S = 61 f (a +€161) + 62 f (41 + €252) + 63 f (x2 + €363) +... +41 f n-1 + Endn) 


depends on the choice of intervals 6 and the quantities ¢. If this now has the property 
that however 6 and € may be chosen, as soon as all 6’s become infinitely small, it 


approaches infinitely close to a fixed limit A, then this value is called [ : St (x)dx. 


If it does not have this property, then [ ia J (x)dx has no meaning. 

However, in several cases, attempts have been made to attribute a meaning to this 
symbol, and among these expansions of the concept of a definite integral there is 
one that is accepted by all mathematicians. To wit, if the function f(x) becomes 
infinitely large as the argument approaches a particular value c in the interval (a, b), 
then obviously the sum S, whatever degree of smallness we attribute to the 6’s, can 
obtain any value whatsoever. Thus it has no limiting value, and f Ff (x)dx, as above, 
would have no meaning. But, should 


cay b 
| Flxdx + | Flx)dx, 


+02 


when a, and a2 become infinitely small, approach a fixed limit, i, J (x)dx is under- 
stood to be this limit value. 

Other statements by Cauchy about the concept of the definite integral in cases 
where, under the basic concept, one does not exist may be useful for certain classes 
of investigations. However, they were not introduced at a general level and, given 
their great arbitrariness, are scarcely appropriate. 


§5 


Second, let us now look at the scope of validity of this concept, or at the question: 
in which cases is a function integrable and in which not? 

We shall first look at the concept of integral in the strict sense, i.e., we assume that 
the sum S converges when all the 6’s become infinitely small. If we call the greatest 
variation of the function between a and x1, i.e., the difference between its largest and 
smallest values in this interval, D,, that between x; and x2, Dz,..., that between 
X,—1 and b, Dy, then 

61D, + 62D2 +--+ +0nDn 


must become infinitely small with the quantities 6. We further assume that, as long 
as all 6’s remain smaller than d, the greatest value that this sum can have is A; A 
will then be a function of d that is always decreasing along with d and becomes 
infinitely small with this quantity. Now if the total size of the intervals in which the 
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variations are greater than o is = s, the contribution these intervals make to the sum 
61D, + 62D2 +--+ +6, Dy is evidently > os. One therefore has 


os = 61D, + 62D. +---+dnDn < A, 
consequently s < 4. 

Now 4, when ga is given, can always be made infinitely small by a suitable choice 
of d; hence the same is true of s, and this results in: 

For the sum S to converge as all 6’s become infinitely small, not only does the 
function f(x) need to be finite, but also the total size of the intervals in which the 
variations are > o (whatever o may be) must be able to be made arbitrarily small by 
a suitable choice of d. 

This theorem can also be inverted: 

If the function f(x) is always finite and, as all the quantities 6 decrease the total 
size s of those intervals where the variations of function f(x) are larger than a given 
value o continually becomes infinitely small, then the sum S will converge as all 6’s 
become infinitely small. 

For, those intervals where the variations are > o contribute an amount smaller 
than s to the sum 6 Dj +62 D2+---+6, D, multiplied by the greatest variation of the 
function between a and b, which (by hypothesis) is finite. The remaining intervals 
contribute an amount < o(b — a). Evidently one can first assume c to be arbitrarily 
small and then always (by hypothesis) so determine the size of the intervals that s 
also becomes arbitrarily small, whence the sum 6; D; + 62D2 +--+ + 6,D,y can 
be given any arbitrary smallness one wishes, and consequently the sum S' can be 
contained within arbitrarily narrow limits. 

Thus we have found conditions that are necessary and sufficient for the sum S to 
converge as the values of 6 infinitely decrease and so we can speak in the strict sense 
of an integral of the function f(x) between a and b. 

If the concept of the integral is now expanded as above, it is evident that for 
integration always to be possible, the latter of the two conditions we found is still 
necessary; but in place of the condition that the function always be finite comes 
the condition that the function only becomes infinite as the argument approaches 
particular values, and that a determinate limiting value emerge as the integration 
limits are approach infinitely close to this value. 


A.5 Schwarz on Squaring the Circle 


From H.A. Schwarz, Ueber einige Abbildungsaufgaben, Journal fiir die reine und 
angewandte Mathematik vol. 70 (1869), pp. 105-120; in Gesammelte Mathematische 
Abhandlung vol. 2 (1890), pp. 65-83, this extract pp. 66-70.° 


8This translation is adapted from the one in A Source Book in Classical Analysis, G. Birkhoff and 
U. Merzbach (eds.) 1973, pp. 56-59. 
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[Schwarz first commented on the striking contrast between Riemann’s general 
mapping theorem and the absence of specific formulas for mapping triangles (say) 
on the unit circle. He then began to solve that problem as follows. ] 

The following fruitful theorem leads to the solution of this and many other map- 
ping problems: If to a continuous sequence of real values of the complex argument 
of an analytic function there corresponds a continuous sequence of real values of 
the function, then to any two conjugate values of the argument there correspond 
conjugate values of the function. In the u-plane, whose points represent the values of 
a complex variable u geometrically, let U’ be a bounded, simply connected domain 
whose boundary contains a finite segment ¢ of the real axis. 

Let a single-valued analytic function ¢ of the complex variable uv, t = f(u) behave 
like an entire function for all values of wu in the interior of U’. That is, if wo denotes 
an arbitrary value of u in the interior of the domain U’ then the function f (uw) can be 
expanded in a power series in uv — uo for the values of u lying in a neighbourhood of 
ug that converges for all sufficiently small absolute values of u — ug. We shall assume 
that f remains bounded as u approaches the boundary @ and is real for all points of 
the line @, and that for all values of the argument u in the interior and the boundary 
of the domain U’ the function t = f(u) varies continuously with the argument w. 

To the domain U’ there corresponds a domain U” whose points lie symmetrically 
with the points of U’ with respect to the real axis. 

At all points of the domain U” an analytic function t is defined that in the domains 
U’ and U” takes conjugate values at conjugate values of u in the domains. If one 
thinks of the two domains U’ and U” as joined along the segment £, then one obtains 
a simply connected domain U’ + U”. For all values of the argument u in the interior 
of U’ + U”, the value of ¢ is uniquely defined; and indeed in the interior of U’ 
and the interior of U” as an analytic function of this argument which behaves like 
an entire function. On crossing the line @, and along that line, the value of t varies 
continuously. It follows from this that the function ¢ determined for the domain U” is 
an analytic continuation of the function determined for the domain U’, and is indeed 
a continuation extending beyond the line ¢. The proof of the validity of this statement 
can be carried out as follows if, as may be assumed, the domain U’ + U” everywhere 
covers the u-plane only simply. 

If uo denotes a value of u in the interior of U’ then, by a theorem of Cauchy, the 
integral 


1 flu) 


2Ti u— uo 


du 


taken in a positive sense around the boundary of the domain U’ or that of U”, has 
the value f (uo) in the first case and the value 0in the second. On adding these two 
integrals, the integrations along the line @, carried out twice and in opposite senses, 
cancel out; and the following equation holds 


1 
fuga. | 22 


Qui u— ug 


du, 
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where the integral is taken along the boundary of the domain U’ + U”, and for all 
values of the quantity wo that are among the points represented geometrically in the 
interior of these domains, represents a continuous function of this argument whose 
values coincide everywhere with the values of the function t = f(u). 

It follows that the function so determined also behaves like an entire function 
for all values in the interior of the segment ¢. Thus, under the assumptions made, 
conjugate values of the argument correspond to conjugate values of the function, 
or, expressed geometrically, the conformal mapping of the (w)-plane onto the (t)- 
plane, whose points represent the values of the complex quantity rf, is symmetric 
with respect to the real axis for both planes: to symmetric points there correspond 
symmetric images. 

If one analytically continues the function t = f (uw), symmetrically on both sides 
of the real axis in the (u)-plane, one is led to the result that singular points of some 
kind lie either singly on the real axis or in pairwise symmetry on both sides of 
it. This theorem can immediately be extended to an analytic function which maps a 
straight line segment in the domain of the argument or on its boundary onto a straight 
line segment in the plane whose points represent the value of the analytic function 
geometrically. 

For the special problem of mapping the surface of a square onto the surface of 
a disk, one easily guesses that, to make the centre of the square correspond to the 
centre of the disk, the four straight lines that are symmetry axes of the square should 
in any case be mapped onto straight lines. This comparison locates the four singular 
points on the circumference of the circle that correspond to the four corners of the 
square under this special assumption. 

Now the solution of the given problem could evidently be simplified by replacing 
the disk by a half- plane, which can be achieved by the transformation by reciprocal 
radii [inversion, JJG]. Indeed, the resulting simplification lies in the circumstance that 
the boundaries of the two regions to be mapped onto each other are now straight lines. 
By the general law given above, the mapping function can be continued analytically 
outside of the square in which it had been originally defined. If the centre of the 
transformation is taken to be one of the singular points on the circumference of the 
circle, it follows that the points t = oo, f = —1,t = 0, t = +1 on the real axis can 
be taken as singular points, while the half-plane lying on the positive side of the real 
axis is a conformal image of the disk. 

If the position of a point inside the given square is determined by the complex 
number u then the problem requires that for all [such] z, the complex variable t be 
an analytic function for all values of the argument u that correspond to points lying 
in the interior of the given square, with the property that all values corresponding 
to points u on the periphery of the square shall have real values. The domain of 
the argument u can now be extended by the above principle, first to the interior of 
four squares symmetrically placed adjacent to the given square, then by repeated 
application to an arbitrarily large domain of the (u)-plane. 

It follows that the [extended] function ¢ must, for all finite values of its argument 
u in the extension of its domain, be a single-valued function and indeed a doubly 
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periodic function of u, the ratio of whose fundamental periods is ./—1. This already 
indicates the lemniscatic function. 

The boundary of the square has singular points at its four corners. Under the 
requirement that the perimeter of the square is to be mapped onto the perimeter of 
the circle or onto the perimeter of the half-plane so that they are equal in the smallest 
parts [infinitesimally conformal], these points must be excepted; otherwise the given 
problem would contain an impossible condition. 

Each piece of the surface of the square near a corner, being a right-angled sector 
near the vertex, must be mapped onto a straight angle by the function describing the 
given mapping. 

This leads to the problem of finding the most general function that maps the sector 
subtending an angle a7 lying near the vertex u = 0 in the (u)-plane, 


so that inside the given boundaries each point u = re® corresponds continuously to 
a point t = pe’, while the values 


r=0,p=0;¢=0,¥=0;d=ar,v=7 


correspond to each other. The simplest function providing such a mapping is the 
function v = u!/, Every other function t of the argument u that also provides a 
mapping with the stated properties considered as a function of the variable v has, by 
the above theorem, the character of an entire function for the value v = O and in the 
vicinity of this value. 

Likewise conversely the quantity v can be expressed as an analytic function of 
the argument f, which behaves like an entire function for all values of the complex 
quantity ¢ in the vicinity of and including t = 0. Hence one obtains the following 
analytic representations, valid in the vicinities of the values v = 0 and t = 0: 


1 
v=ul/® ¢= Cul t+av+agu? +---);v = —t(1 + dit + byt? +---); 


=c 


u=v°; i i ee ea 
=U; = Ga 1 2 . 
The constant C is positive and different from 0; the coefficients a, b, c all have 
real values; the latter follows from the fact that to all sufficiently small positive values 
of u with respect to the quantity v positive values of t must again correspond. 
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[Schwarz continued and eventually obtained what became knownas the ‘Schwarz— 
Christoffel’ transformation 


r ; dt 
" ee: 


which is a lemniscatic integral that represents the interior of two half-planes divided 
I 
by the real axis onto the interior of a square with sides [ —_“!___ The substitution 
; V4t(1—t2) 


s= = takes one from the half-plane lying on the positive side of the real axis in the 
(t)-plane to the surface of the circle lying in the (s)-plane with radius 1 and centre 
the point s = 0.] 


Appendix B 
Series of Functions 


Very often, functions are defined as convergent infinite sums of other functions, 
and when this is done we can ask if the properties of the individual terms of the sum 
(integrable, continuous, differentiable, ...) also belong to the limit function. The over- 
simplified general philosophy is that they do if the convergence is uniform, otherwise 
not. The troublesome case is differentiability: the terms may be differentiable and 
the convergence uniform, without the limit function being differentiable. When the 
relevant property is assured, the question of term by term integration or differentiation 
arises. 

Two classes of infinite series are generally encountered: power series and Fourier 
series. A power Series has a radius of convergence, and on any smaller interval (or, 
in the complex case, disc) convergence is uniform, so with that restriction on the 
domain of definition, the theorems stated below do apply. 

Fourier series are much more complicated. The ‘standard hypotheses’ on a func- 
tion are that it is periodic (say with period 27), regulated on [—7, 7], and has the 
‘right’ Fourier coefficients: 


TT Tv 
1 1 
a, = — i f(x) cosnxdx, by, = — / f(x) sinnxdx. 
T T 
=. =i 


A regulated function is defined as the uniform limit of step functions. Among the 
regulated functions are functions continuous on a closed interval, say [a, b], and they, 
like all regulated functions, are integrable. A regulated function, f and its indefinite 
integral, F(t) := f Ff (x)dx, satisfy the fundamental theorem of the calculus: 


F'(t) = f(t). 


Even when a function f satisfies the standard hypotheses, sufficient hypotheses for 
the Fourier series of the function f to convergence pointwise to f are that the function 
f be differentiable, and sufficient hypotheses for a Fourier series of a function f to 
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converge uniformly to the function are that the function f be C? (twice continuously 
differentiable). 


Uniform Convergence and Continuity 


A sequence of functions f;, (x) defined on a domain D tends to a function f(x) on 
that domain if for each x € D and every ¢ > 0 there is an N = N(E, x) such that 
n > N implies that | f,(~) — f(x)| < e¢. If N can be chosen for all x € D the 
convergence is said to be uniform, otherwise it is said to be pointwise. 

If a sequence of functions { f,}, each continuous at a point x9 € (a,b), tends 
uniformly to a function f then the function f is continuous at the point xo. 

If a series of functions >*,, fn, each continuous at a point xo, tends uniformly to 
a function f then the function f is continuous at the point xo. 


Uniform Convergence and Integrability 


If aseries of functions >°,, fn, each continuous and Riemann-integrable on an interval 
(a, b), tends uniformly to a function f then the function f is Riemann-integrable 
and the integral of the sum is sum of the integrals; i.e. term by term integration is 
valid. 


Uniform Convergence and Differentiability 


Here, uniformity alone is not enough. Indeed, it is possible for a sequence of differen- 
tiable functions { f,} to converge uniformly on an open interval and for the sequence 
{f,} to fail to converge pointwise on that interval, as we shall see below. However, 
the following more delicate results hold. 


If {fn} is a sequence of functions, each with a finite derivative at every point of the 
open interval (a, b), and: 

If there is at least one point xo in (a, b) such that the sequence { f,(xo)} converges, 
and 

If there is a function g on (a, b) such that the sequence of derived functions { f/} 
converges uniformly to the function g on (a, b), 

Then there is a function f such that the sequence { f,,} converges uniformly to f on 
(a, b), and for each xo in (a, b) the derivative f’(x) exists and f’(x) = g(x). 


If >”, fn is a series of functions, each with a finite derivative at every point of the 
open interval (a, b), and: 

If there is at least one point xo in (a, b) such that the series >”, fn (xo) converges, 
and 

If there is a function g on (a, b) such that the series of derived functions >”, f/ 
converges uniformly to the function g on (a, b), 

Then there is a function f such that the series >”, f, converges uniformly to f on 
(a, b), and for each xg in (a, b) the derivative f’(x) exists and >”, f’(x) = g(x). 
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Non-uniform Convergence of Smooth Functions 


Define f(x) by: 
-1 x<-t 
‘ 1 1 
ee we oe 
1 x> # 


All these functions are differentiable, but the limit is —1 when x < 0,0 when x = 0, 
and +1 when x > 0. 

Indeed, worse can happen. For example, a sequence of functions, (f;,), can tend 
uniformly to a smooth function, f, and yet the sequence of derived functions (f,/) 
can fail to converge even pointwise, and a fortiori therefore there is no question of 


the sequence converging to some limit f’. This is illustrated by the sequence a) 


Suppose, however, that a sequence of functions, (f,) defined on the interval 
(a, b) tends pointwise (not necessarily uniformly) to a smooth function, f, that each 
function /,, is differentiable and has a bounded derivative on (a, b) and the sequence 
of derived functions (f,) converges uniformly to some limit g. Then the conditions 
of the above theorem apply, and we can deduce that the sequence of derived functions 
(f,) converges uniformly to some limit f’, the derivative of f—in other words, term 
by term differentiation is permitted. 

The subtleties are well illustrated by Abel’s series. Recall that f,(x) := 

4-(-D*! ate In the interval (—1, 1), say, these functions do a remarkable 
job of approximating the function x /2 (recall Fig. 4.3). Indeed, they do so well that 
you might imagine that the derived functions 


Fig. B.1_ Much of the tail of Abel’s series 
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a) He OBIT NWl4 ene of! Ill Tok ill ole mira 


-1- 


Fig. B.2 Much of the tail of the derivative of Abel’s series 


fix) = =P (—-1)*! cos(kx) 


approximate the constant function 1/2 equally well—but they do not (recall Fig. 4.4). 
Figure B.1 shows the contribution of terms 101 to 300 to f,(x) — x/2 and Fig. B.2 
shows their contribution to f/(«) — 1/2; they make very visible the fact that the 
convergence of the f,(x) to x/2 is uniform, but that the derived functions do not 
even converge for all x. You need to look carefully to see that in the first case the 
uniform bound is already down to 0.007, whereas in the second case it is still 1. This 
suggests, but does not prove the stated results. 

Note that pointwise convergence of the sequence (/;,) would have been enough 
for the theorem, but even uniform convergence, as here, is not enough to control the 
behaviour of the sequence of derived functions. 

This illustrates a profound moral, that two functions may differ very little and yet 
their derivatives may differ a lot, as is the case with f(x) = Xf (— 1k = sin(kx) and 
x2: 


Appendix C 
Potential Theory: A Mathematical Summary 


The next two subsections revise the mathematics of what was discussed histori- 
cally in Chap. 13. The first covers vector methods (div, grad, and curl). The second 
looks ahead to the connection between potential theory and the theory of harmonic 
functions, which became very important for Riemann’s pioneering development of 
complex function theory. 


Vector Methods 


The calculus of vector fields is concerned with three operators on functions or vector 
fields: grad, div, and curl. To define them we need some notation. We write a function 


u of n variables x1, X2,..., Xn, ASU (x1, X2,..-, Xn), and the ith partial derivative of 

uv as either gu or as Uy,. We write a vector field as v = (v1, v2, ..., Un), where each 
ial . . . 

component v; is a function of the variables x;,x2,...,Xn. 


First, grad applies to a function u and produces a vector: 
grad u = Vu := (heii Usdin Ux,) ; 
Next, div applied to a vector field v produces a scalar: 
div(v) = V-v= vy, + vy, +--+ + Ux,. 


Finally, curl applied to a vector field produces a vector field, but the definition we 
give here (the usual one) only makes sense when there are 3 variables (n = 3)—be 
careful. In this setting, the ith component of the vector field curl v = V x v is 


Ov j Ovg . . : he ie 
Ui oe where the variables are taken in cyclic order i, j, k. 


A major feature of the study of vector fields in the plane or in space is the use of 
line and surface integrals. The main theorems about them, Gauss’s divergence the- 
orem and Stokes’s theorem, generalise the fundamental theorem of the one variable 
calculus. Both concern an arbitrary vector field v in space. 
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Gauss’s divergence theorem concerns a region U of R? inside a closed surface 
S in space, and relates the normal component of v on the boundary S to the integral 
of the divergence of the vector field inside U: 


[vane [vw 


surf vol 


It is about surface and volume integrals. 

Stokes’s theorem concerns a surface in S in R* bounded by a closed curve ¥, 
and relates the integral of the tangential component of v along the boundary to the 
integral of the normal component of the curl of the vector field over the surface S: 


J vas= [ov xven, 


curve surf 


It is about line and surface integrals. 

A potential function associated to a vector field v is a function u such that Vu = v. 
It is an immediate consequence of Stokes’s theorem that in many circumstances the 
line integral of a potential functions depends only on the end points, not the path. It 
is interesting to note that Stokes first stated his theorem as a problem in the Smith’s 
Prize examination at Cambridge in 1854. The Smith’s Prize was an annual event 
designed to test the best recent Cambridge graduates and to give then a chance to 
develop an interest in original research. Technical mastery of the material, which is 
all the proof requires, was regarded a necessary pre-requisite for any kind of further 
study. 


The Gauss—Green Theorem 


A good way to think of the mathematics involved in Gauss’s and Green’s theorems 
is to realise that it all derives from one theorem, which I shall call the Gauss—Green 
theorem. This theorem refers to a region U of R” with a boundary OU on which there 
is aunit outward normal ° = wv, eee vy"). Write the gradient of a function u with 


continuous first derivatives on U as Vu = ou Ou nodes Ou , and the (outward) 
Ox? Oxo OXn 


normal derivative as He =°.Vu 
The Gauss—Green theorem states that 


[usax= [ was, i=1...n. 
U 


OU 


It immediately implies, on integrating wv by parts, the corollary that 


Jrsvdx=— fuvgas+ [wvdias, a Geer 8 


U U OU 
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We now upgrade the functions u, v to have continuous second derivatives, and 


obtain Green’s formulas: 
0 
1. [dudx = [ as 
Ov 
U QU 


Ov 
2. [vesudx == fuavs [was 
Ov 
U OU 


U 


Ov Ou 
3. [wav-vawax= f (w52 0) as. 
U 


OU 


Formula | follows from the Corollary on replacing u with ux, and v with 1. 
Formula 2 follows from the Corollary on replacing v with v,,. 
Formula 3 follows from Formula 2 by interchanging u and v and subtracting. 


Potential and Harmonic Functions 


There is a close connection between potential and harmonic functions, as the next 
result indicates. The function f(r) = ; is the one appropriate to problems involving 
mass, magnetism or electricity. Write this as 


fayD=@?+y? 42)? 


and set R = (x2 + y? + z2)—!/2. We note that OR = —xR? with similar formulae 
for the other partial derivatives, so 


Vf = (xR?, yR?, zR?). 


We now compute the Laplacian of f, Af = V- Vf. We differentiate the first 
component, x R* with respect to x and find 


0 
—— (Ra = 3 RR. 
Ox 

So the Laplacian turns out to be 


a5 Re 8 I Sar 
= 3R? —3(x? +: y+ 22)R° =3R? -3R? =0. 
This computation is valid in any region in which f ~ om, that is, in any region in 


which there is no mass, and we deduce that in any region containing no mass the 
potential function f(r) = i is harmonic—it satisfies Laplace’s equation. 
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This turns out to be generally true: typically a potential function in a region where 
it is never infinite is harmonic. This makes for a very close connection between 
potential theory and the theory of harmonic functions. 

In fact, although it is a matter of vector algebra that V- V x F = 0 for any smooth 
vector field F, and that V x V f = 0 for any smooth function /, both these results 
have non-trivial consequences. In each case, we consider only nice regions of R°— 
for example star-shaped regions—in which nothing becomes infinite. In physical 
terms there are no sources present. In the first case, we can say that V - F = 0 if and 
only if there is a smooth vector field G such that F = V x G. In the second case, we 
can say that V x F = Oif and only if there is a smooth function f such that F = Vf. 
We will prove these claims in a moment. 

Using these results, consider a vector field F which admits a potential function f, 
so F = V f. Now we calculate the Laplacian of the function f, and we find 


Af=V-Vf=V-F, 


and so by the first result Af = 0 if and only if f is the potential function of a vector 
field whose divergence vanishes, which happens if and only if the vector field is a 
curl, that is, it is of the form V x G for some other vector field G. 

We deduce that every potential function, in regions where it is not infinite (no 
sources are present) is a harmonic function. 

We shall now see that in any simply connected domain, if a vector field G = 0 
then there is vector field F such that G = V x F. Let us first fix some notation: 
F = (F\, Fo, F3) and G = (Gj, Go, G3). 

Now we use the fact that for any smooth function f, V x Vf = 0 to simplify 
the problem. We find a function f such that of = —F%, which is easy: define f by 
fMm=- i jf3dz+ any arbitrary function g(x, y). The integration is done treating 
f asa function of z alone. The arbitrary function g can be set to zero or used to keep 
the expressions symmetric when working on a specific problem. 

We are reduced to solving the problem of finding a vector field F of the form 
F = (F}, Fo, 0). The equations we have to solve are therefore these three 


Fy, = Gi, 
—F), = Go, 
Pi; — Fy, = G3. 


This is three equations for two unknowns, so the system is said to be over-determined 
and perhaps cannot be solved. Notice that they imply that G1, + G2, + G3, = 0, 
which is just the condition that V - G = 0, so the given condition is necessary; we 
are about to see that it is sufficient. 

We integrate the first equation with respect to z and find 


r 
F,(r) = | Gide hee, ai, 
0 
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where f(x, y) is an arbitrary function of x and y. We substitute this into the third for 
equation, and get these two equations for the remaining component F| of the vector 
field F: 


r 
Fi,(%) = | Gidz+ G3 +h, y), 
0 


This is two equations for F'. We solve the second one by a simple integration, to get 


r 
AW = -| Gudea: 
0 


where k(x, y) is another arbitrary function of x and y. Then we differentiate this 
expression for F with respect to y and substitute that in the other equation, to get 


r 


r 
Fy) = f Go,dz+ky(x.y) = f Gidz+G3+h,(x, y). 
0 0 


This rearranges to become 


kor») = hstesy) = f (G2, — Gi) — G3. 


To finish off we can set one of / or k to zero and integrate. In any given problem 
it is likely that one or other of these methods will make the integration easy or the 
expression for F attractive. But in any case we have shown that if a vector field 
G = 0 then there is vector field F such that G = V x F. The only condition is 
that the path integrals make sense, and this is guaranteed in IR? or any other simply 
connected space. 

Next, let us be more precise about when a vector field F is the gradient of a 
(potential) function. Notice that this is the case if and only if the line integral of the 
field is independent of the path. ‘Only if’ goes this way: we have Lm F.dr = 1s V f.dr 
and this works out, by the chain rule, to be the integral of df along the path, which 
is plainly the difference in the values of f at the endpoints whatever path is chosen. 
The ‘if’ part is little more than writing down f(x) := i F.dr for an arbitrary p and 
checking that it makes sense—which it does because the integral is assumed to be 
independent of the path. 

So a vector field is the gradient of a function if and only if integrals along it are 
independent of the path, which is the same—by Stokes’s theorem—as saying if and 
only if integrals around closed loops always enclose a surface within which the curl 
V x F vanishes. And we know that the vanishing of the curl of F is a necessary 
condition for F to be given by a potential function. So a vector field is given by a 
potential function if and only if any closed loop in the regions outside the sources of 
the field contains a surface. This is usually true in questions about gravitation, but 
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not in electro-magnetic theory where circuits flow along closed loops of wire or, it is 
sometimes supposed, along infinitely long wires. Later mathematicians (and indeed 
Gauss) were to see this as the start of the study of the topology of spaces of the form 
R? — stuf f. 
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